scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-07 23:43:31 +00:00

Author	SHA1	Message	Date
Botond Dénes	d309a86708	Merge 'Add keyspace_offstrategy_compaction api' from Benny Halevy This series adds methods to perform offstrategy compaction, if needed, returning a future<bool> so the caller can wait on it until compaction completes. The returned value is true iff offstrategy compaction was needed. The added keyspace_offstrategy_compaction calls perform_offstrategy_compaction on the specified keyspace and tables, return the number of tables that required offstrategy compaction. A respective unit test was added to the rest_api pytest. This PR replaces https://github.com/scylladb/scylla/pull/9095 that suggested adding an option to `keyspace_compaction` since offstrategy compaction triggering logic is different enough from major compaction meriting a new api. Test: unit (dev) Closes #9980 * github.com:scylladb/scylla: test: rest_api: add unit tests for keyspace_offstrategy_compaction api api: add keyspace_offstrategy_compaction compaction_manager: get rid of submit_offstrategy table: add perform_offstrategy_compaction compaction_manager: perform_offstrategy: print ks.cf in log messages compaction_manager: allow waiting on offstrategy compaction	2022-02-02 13:15:31 +02:00
Tomasz Grabiec	00a9326ae7	Merge "raft: let `modify_config` finish on a follower that removes itself" from Kamil When forwarding a reconfiguration request from follower to a leader in `modify_config`, there is no reason to wait for the follower's commit index to be updated. The only useful information is that the leader committed the configuration change - so `modify_config` should return as soon as we know that. There is a reason not to wait for the follower's commit index to be updated: if the configuration change removes the follower, the follower will never learn about it, so a local waiter will never be resolved. `execute_modify_config` - the part of `modify_config` executed on the leader - is thus modified to finish when the configuration change is fully complete (including the dummy entry appended at the end), and `modify_config` - which does the forwarding - no longer creates a local waiter, but returns as soon as the RPC call to the leader confirms that the entry was committed on the leader. We still return an `entry_id` from `execute_modify_config` but that's just an artifact of the implementation. Fixes #9981. A regression test was also added in randomized_nemesis_test. * kbr/modify-config-finishes-v1: test: raft: randomized_nemesis_test: regression test for #9981 raft: server: don't create local waiter in `modify_config`	2022-01-31 20:14:50 +01:00
Nadav Har'El	8a745593a2	Merge 'alternator: fill UnprocessedKeys for failed batch reads' from Piotr Sarna DynamoDB protocol specifies that when getting items in a batch failed only partially, unprocessed keys can be returned so that the user can perform a retry. Alternator used to fail the whole request if any of the reads failed, but right now it instead produces the list of unprocessed keys and returns them to the user, as long as at least 1 read was successful. This series comes with a test based on Scylla's error injection mechanism, and thus is only useful in modes which come with error injection compiled in. In release mode, expect to see the following message: SKIPPED (Error injection not enabled in Scylla - try compiling in dev/debug/sanitize mode) Fixes #9984 Closes #9986 * github.com:scylladb/scylla: test: add total failure case for GetBatchItem test: add error injection case for GetBatchItem test: add a context manager for error injection to alternator alternator: add error injection to BatchGetItem alternator: fill UnprocessedKeys for failed batch reads	2022-01-31 15:28:24 +02:00
Piotr Sarna	c87126198d	test: add total failure case for GetBatchItem The test verifies that if all reads from a batch operation failed, the result is an error, and not a success response with UnprocessedKeys parameter set to all keys.	2022-01-31 14:21:55 +01:00
Piotr Sarna	e79c2943fc	test: add error injection case for GetBatchItem The new test case is based on Scylla error injection mechanism and forces a partial read by failing some requests from the batch.	2022-01-31 14:21:55 +01:00
Piotr Sarna	99c5bec0e2	test: add a context manager for error injection to alternator With the new context manager it's now easier to request an error to be injected via REST API. Note that error injection is only enabled in certain build modes (dev, debug, sanitize) and the test case will be skipped if it's not possible to use this mechanism.	2022-01-31 14:21:55 +01:00
Tomasz Grabiec	8297ae531d	Merge "Automatically retry CQL DDL statements in presence of concurrent changes" from Kamil Schema changes on top of Raft do not allow concurrent changes. If two changes are attempted concurrently, one of them gets `group0_concurrent_modification` exception. Catch the exception in CQL DDL statement execution function and retry. In addition, improve the description of CQL DDL statements in group 0 history table. Add a test which checks that group 0 history grows iff a schema change does not throw `group0_concurrent_modification`. Also check that the retry mechanism works as expected. * kbr/ddl-retry-v1: test: unit test for group 0 concurrent change protection and CQL DDL retries cql3: statements: schema_altering_statement: automatically retry in presence of concurrent changes	2022-01-31 14:12:35 +01:00
Tomasz Grabiec	b78bab7286	Merge "raft: fixes and improvements to the library and nemesis test" from Kamil Raft randomized nemesis test was improved by adding some more chaos: randomizing the network delay, server configuration, ticking speed of servers. This allowed to catch a serious bug, which is fixed in the first patch. The patchset also fixes bugs in the test itself and adds quality of life improvements such as better diagnostics when inconsistency is detected. * kbr/nemesis-random-v1: test: raft: randomized_nemesis_test: print state of each state machine when detecting inconsistency test: raft: randomized_nemesis_test: print details when detecting inconsistency test: raft: randomized_nemesis_test: print snapshot details when taking/loading snapshots in `impure_state_machine` test: raft: randomized_nemesis_test: keep server id in impure_state_machine test: raft: randomized_nemesis_test: frequent snapshotting configuration test: raft: randomized_nemesis_test: tick servers at different speeds in generator test test: raft: randomized_nemesis_test: simplify ticker test: raft: randomized_nemesis_test: randomize network delay test: raft: randomized_nemesis_test: fix use-after-free in `environment::crash()` test: raft: randomized_nemesis_test: fix use-after-free in two-way rpc functions test: raft: randomized_nemesis_test: rpc: don't propagate `gate_closed_exception` outside test: raft: randomized_nemesis_test: fix obsolete comment raft: fsm: print configuration entries appearing in the log raft: `operator<<(ostream&, ...)` implementation for `server_address` and `configuration` raft: server: abort snapshot applications before waiting for rpc abort raft: server: logging fix raft: fsm: don't advance commit index beyond matched entries	2022-01-31 13:25:27 +01:00
Mikołaj Sielużycki	93d6eb6d51	compacting_reader: Support fast_forward_to position range. Fast forwarding is delegated to the underlying reader and assumes the it's supported. The only corner case requiring special handling that has shown up in the tests is producing partition start mutation in the forwarding case if there are no other fragments. compacting state keeps track of uncompacted partition start, but doesn't emit it by default. If end of stream is reached without producing a mutation fragment, partition start is not emitted. This is invalid behaviour in the forwarding case, so I've added a public method to compacting state to force marking partition as non-empty. I don't like this solution, as it feels like breaking an abstraction, but I didn't come across a better idea. Tests: unit(dev, debug, release) Message-Id: <20220128131021.93743-1-mikolaj.sieluzycki@scylladb.com>	2022-01-31 13:37:36 +02:00
Nadav Har'El	a25e265373	test/alternator: improve comment on why we need "global_random" Improve the comment that explains why we needed to use an explicitly shared random sequence instead of the usual "random". We now understand that we need this workaround to undo what the pytest-randomly plugin does. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220130155557.1181345-1-nyh@scylladb.com>	2022-01-31 10:07:56 +01:00
Nadav Har'El	59fe6a402c	test/cql-pytest: use unique keys instead of random keys Some of the tests in test/cql-pytest share the same table but use different keys to ensure they don't collide. Before this patch we used a random key, which was usually fine, but we recently noticed that the pytest-randomly plugin may cause different tests to run through the same sequence of random numbers and ruin our intent that different tests use different keys. So instead of using a random key, let's use a unique key. We can achieve this uniqueness trivially - using a counter variable - because anyway the uniqueness is only needed inside a single temporary table - which is different in every run. Another benefit is that it will now be clearer that the tests are deterministic and not random - the intent of a random_string() key was never to randomly walk the entire key space (random_string() anyway had a pretty narrow idea of what a random string looks like) - it was just to get a unique key. Refs #9988 (fixes it for cql-pytest, but not for test/alternator) Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-01-31 09:01:23 +02:00
Benny Halevy	1c25934399	test: rest_api: add unit tests for keyspace_offstrategy_compaction api Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-01-30 20:40:40 +02:00
Tomasz Grabiec	b734615f51	util: cached_file: Fix corruption after memory reclamation was triggered from population If memory reclamation is triggered inside _cache.emplace(), the _cache btree can get corrupted. Reclaimers erase from it, and emplace() assumes that the tree is not modified during its execution. It first locates the target node and then does memory allocation. Fix by running emplace() under allocating section, which disables memory reclamation. The bug manifests with assert failures, e.g: ./utils/bptree.hh:1699: void bplus::node<unsigned long, cached_file::cached_page, cached_file::page_idx_less_comparator, 12, bplus::key_search::linear, bplus::with_debug::no>::refill(Less) [Key = unsigned long, T = cached_file::cached_page, Less = cached_file::page_idx_less_comparator, NodeSize = 12, Search = bplus::key_search::linear, Debug = bplus::with_debug::no]: Assertion `p._kids[i].n == this' failed. Fixes #9915 Message-Id: <20220130175639.15258-1-tgrabiec@scylladb.com>	2022-01-30 19:57:35 +02:00
Piotr Sarna	471205bdcf	test/alternator: use a global random generator for all test cases It was observed (perhaps it depends on the Python implementation) that an identical seed was used for multiple test cases, which violated the assumption that generated values are in fact unique. Using a global generator instead makes sure that it was only seeded once. Tests: unit(dev) # alternator tests used to fail for me locally before this patch was applied Message-Id: <315d372b4363f449d04b57f7a7d701dcb9a6160a.1643365856.git.sarna@scylladb.com>	2022-01-30 16:40:20 +02:00
Kamil Braun	d10b508380	test: raft: randomized_nemesis_test: regression test for #9981	2022-01-27 17:50:40 +01:00
Kamil Braun	4a52b802ac	test: unit test for group 0 concurrent change protection and CQL DDL retries Check that group 0 history grows iff a schema change does not throw `group0_concurrent_modification`. Check that the CQL DDL statement retry mechanism works as expected.	2022-01-27 11:26:15 +01:00
Tomasz Grabiec	ba6c02b38a	Merge "Clear old entries from group 0 history when performing schema changes" from Kamil When performing a change through group 0 (which right now means schema changes), clear entries from group 0 history table which are older than one week. This is done by including an appropriate range tombstone in the group 0 history table mutation. * kbr/g0-history-gc-v2: idl: group0_state_machine: fix license blurb test: unit test for clearing old entries in group0 history service: migration_manager: clear old entries from group 0 history when announcing	2022-01-26 16:12:40 +01:00
Kamil Braun	95ac8ead4f	test: raft: randomized_nemesis_test: print state of each state machine when detecting inconsistency	2022-01-26 16:09:41 +01:00
Kamil Braun	e249ea5aef	test: raft: randomized_nemesis_test: print details when detecting inconsistency If the returned result is inconsistent with the constructed model, print the differences in detail instead of just failing an assertion.	2022-01-26 16:09:41 +01:00
Kamil Braun	1170e47af4	test: raft: randomized_nemesis_test: print snapshot details when taking/loading snapshots in `impure_state_machine` Useful for debugging.	2022-01-26 16:09:41 +01:00
Kamil Braun	b8158e0b43	test: raft: randomized_nemesis_test: keep server id in impure_state_machine Will be used for logging.	2022-01-26 16:09:41 +01:00
Kamil Braun	3c01449472	test: raft: randomized_nemesis_test: frequent snapshotting configuration With probability 1/2, run the test with a configuration that causes servers to take snapshots frequently.	2022-01-26 16:09:41 +01:00
Kamil Braun	7546a9ebb5	test: raft: randomized_nemesis_test: tick servers at different speeds in generator test Previously all servers were ticked at the same moment, every 10 network/timer ticks. Now we tick each server with probability 1/10 on each network/timer tick. Thus, on average, every server is ticked once per 10 ticks. But now we're able to obtain more interesting behaviors. E.g. we can now observe servers which are stalling for as long as 10 ticks and servers which temporarily speed up to tick once per each network tick.	2022-01-26 16:09:41 +01:00
Kamil Braun	5d986b2682	test: raft: randomized_nemesis_test: simplify ticker Instead of taking a set of functions with different periods, take a single function that is called on every tick. The periodicity can be implemented easily on the user side.	2022-01-26 16:09:41 +01:00
Kamil Braun	173fb2bf36	test: raft: randomized_nemesis_test: randomize network delay As a side effect, this causes messages to be delivered in a different order they were sent, adding even more chaos.	2022-01-26 16:09:41 +01:00
Kamil Braun	00c18adbb0	test: raft: randomized_nemesis_test: fix use-after-free in `environment::crash()` The lambda attached to `_crash_fiber` was a coroutine. The coroutine would use `this` captured by the lambda after the `co_await`, where the lambda object (hence its captures) was already destroyed. No idea why it worked before and sanitizers did not complain in debug mode.	2022-01-26 16:09:41 +01:00
Kamil Braun	4c68e6a04c	test: raft: randomized_nemesis_test: fix use-after-free in two-way rpc functions Two-way RPC functions such as `send_snapshot` had a guard object which was captured in a lambda passed to `with_gate`. The guard object, on destruction, accessed the `rpc` object. Unfortunately, the guard object could outlive the `rpc` object. That's because the lambda, and hence the guard object, was destroyed after `with_gate` finished (it lived in the frame of the caller of `with_gate`, i.e. `send_snapshot` and others), so it could be destroyed after `rpc` (the gate prevents `rpc` from being destroyed). Make sure that the guard object is destroyed before `with_gate` finishes by creating it inside the lambda body - not capturing inside the object.	2022-01-26 16:09:41 +01:00
Kamil Braun	871f0d00ce	test: raft: randomized_nemesis_test: rpc: don't propagate `gate_closed_exception` outside The `raft::rpc` interface functions are called by `raft::server_impl` and the exceptions may be propagated outside the server, e.g. through the `add_entry` API. Translate the internal `gate_closed_exception` to an external `raft::stopped_error`.	2022-01-26 16:09:41 +01:00
Kamil Braun	9da4ffc1c7	test: raft: randomized_nemesis_test: fix obsolete comment	2022-01-26 16:09:41 +01:00
Kamil Braun	44a1a8a8b0	raft: `operator<<(ostream&, ...)` implementation for `server_address` and `configuration` Useful for debugging. Had to make `configuration` constructor explicit. Otherwise the `operator<<` implementation for `configuration` would implicitly convert the `server_address` to `configuration` when trying to output it, causing infinite recursion. Removed implicit uses of the constructor.	2022-01-26 16:09:41 +01:00
Gleb Natapov	579dcf187a	raft: allow an option to persist commit index Raft does not need to persist the commit index since a restarted node will either learn it from an append message from a leader or (if entire cluster is restarted and hence there is no leader) new leader will figure it out after contacting a quorum. But some users may want to be able to bring their local state machine to a state as up-to-date as it was before restart as soon as possible without any external communication. For them this patch introduces new persistence API that allows saving and restoring last seen committed index. Message-Id: <YfFD53oS2j1My0p/@scylladb.com>	2022-01-26 14:06:39 +01:00
Kamil Braun	b863a63b08	test: unit test for clearing old entries in group0 history We perform a bunch of schema changes with different values of `migration_manager::_group0_history_gc_duration` and check if entries are cleared according to this setting.	2022-01-25 13:13:35 +01:00
Botond Dénes	eb42213db4	compact_mutation: close active range tombstone on page end The compactor recently acquired the ability to consume a v2 stream. The v2 spec requires that all streams end with a null tombstone. `range_tombstone_assembler`, the component the compactor uses for converting the v2 input into its v1 output enforces this with a check on `consume_end_of_partition()`. Normally the producer of the stream the compactor is consuming takes care of closing the active tombstone before the stream ends. The compactor however (or its consumer) can decide to end the consume early, e.g. to cut the current page. When this happens the compactor must take care of closing the tombstone itself. Furthermore it has to keep this tombstone around to re-open it on the next page. This patch implements this mechanism which was left out of `134601a15e`. It also adds a unit test which reproduces the problems caused by the missing mechanism. The compactor now tracks the last clustering position emitted. When the page ends, this position will be used as the position of the closing range tombstone change. This ensures the range tombstone only covers the actually emitted range. Fixes: #9907 Tests: unit(dev), dtest(paging_test.py, paging_additional_test.py) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20220114053215.481860-1-bdenes@scylladb.com>	2022-01-25 09:52:30 +02:00
Kamil Braun	a664ac7ba5	treewide: require `group0_guard` when performing schema changes `announce` now takes a `group0_guard` by value. `group0_guard` can only be obtained through `migration_manager::start_group0_operation` and moved, it cannot be constructed outside `migration_manager`. The guard will be a method of ensuring linearizability for group 0 operations.	2022-01-24 15:20:35 +01:00
Kamil Braun	283ac7fefe	treewide: pass mutation timestamp from call sites into `migration_manager::prepare_*` functions The functions which prepare schema change mutations (such as `prepare_new_column_family_announcement`) would use internally generated timestamps for these mutations. When schema changes are managed by group 0 we want to ensure that timestamps of mutations applied through Raft are monotonic. We will generate these timestamps at call sites and pass them into the `prepare_` functions. This commit prepares the APIs.	2022-01-24 15:12:50 +01:00
Benny Halevy	188cedd533	test: lister_test: test_lister_abort: generate at least one entry Without this fix, generate_random_content could generate 0 entries and the expected exception would never be injected. With it, we generate at least 1 entry and the test passes with the offending random-seed: ``` random-seed=1898914316 Generated 1 dir entries Aborting lister after 1 dir entries test/boost/lister_test.cc(96): info: check 'exception "expected_exception" raised as expected' has passed ``` Fixes #9953 Test: lister_test.test_lister_abort --random-seed=1898914316(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220123122921.14017-1-bhalevy@scylladb.com>	2022-01-23 17:52:44 +02:00
Benny Halevy	f439edca35	test: sstable_compaction_test: twcs_reshape_with_disjoint_set_test: take min_threshold into consideration Take into account that get_reshaping_job selects only buckets that have more than min_threashold sstables in them. Therefore, with 256 disjoint sstables in different windows, allow first or last windows to not be selected by get_reshaping_job that will return at least disjoint_sstable_count - min_threshold + 1 sstables, and not more than disjoint_sstable_count. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220123090044.38449-2-bhalevy@scylladb.com>	2022-01-23 17:52:44 +02:00
Piotr Jastrzebski	09d4438a0d	cdc: Handle compact storage correctly in preimage Base tables that use compact storage may have a special artificial column that has an empty type. `c010cefc4d` fixed the main CDC path to handle such columns correctly and to not include them in the CDC Log schema. This patch makes sure that generation of preimage ignores such empty column as well. Fixes #9876 Closes #9910 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2022-01-20 13:23:38 +01:00
Nadav Har'El	7cb6250c40	Merge 'snapshot_ctl: true_snapshots_size: fix space accounting' from Benny Halevy This pull request fixes two preexisting issues related to snapshot_ctl::true_snapshots_size https://github.com/scylladb/scylla/issues/9897 https://github.com/scylladb/scylla/issues/9898 And adds a couple unit tests to tests the snapshot_ctl functionality. Test: unit(dev), database_test.{test_snapshot_ctl_details,test_snapshot_ctl_true_snapshots_size}(debug) Closes #9899 * github.com:scylladb/scylla: table: get_snapshot_details: count allocated_size snapshot_ctl: cleanup true_snapshots_size snpashot_ctl: true_snapshots_size: do not map_reduce across all shards	2022-01-19 11:57:15 +02:00
Benny Halevy	5db3cbe1e4	snpashot_ctl: true_snapshots_size: do not map_reduce across all shards snapshot_ctl uses map_reduce over all database shards, each counting the size of the snapshots directory, which is shared, not per-shard. So the total live size returned by it is multiples by the number of shards. Add a unit test to test that. Fixes #9897 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-01-19 07:50:53 +02:00
Nadav Har'El	1ce73c2ab3	Merge 'utils::is_timeout_exception: Ensure we handle nested exception types' from Calle Wilund Fixes #9922 storage proxy uses is_timeout_exception to traverse different code paths. `a6202ae079` broke this (because bit rot and intermixing), by wrapping exception for information purposes. This adds check of nested types in exception handling, as well as a test for the routine itself. Closes #9932 * github.com:scylladb/scylla: database/storage_proxy: Use "is_timeout_exception" instead of catch match utils::is_timeout_exception: Ensure we handle nested exception types	2022-01-18 23:49:41 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Raphael S. Carvalho	299ffb1e1a	compaction: make TWCS reshape on a time bucket with tons of files much more efficient Currently, when TWCS reshape finds a bucket containing more than 32 files, it will blindly resize that bucket to 32. That's very bad because it doesn't take into consideration that compaction efficiency depends on relative sizes of files being compacted together, meaning that a huge file can be compacted with a tiny one, producing lots of write amplification. To solve this problem, STCS reshape logic will now be reused in each time bucket. So only similar-sized files are compacted together and the time bucket will be considered reshaped once its size tiers are properly compacted, according to the reshape mode. Fixes #9938. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220117205000.121614-1-raphaelsc@scylladb.com>	2022-01-18 12:33:54 +02:00
Avi Kivity	7260d8abed	Merge "index_reader: improve verify_end_state()" from Botond " Said method should take care of checking that parsing stopped in a valid state. This patch-set expands the existing but very lacking implementation by improving the existing error message and adding an additional check for prematurely exiting the parser in the middle of parsing an index entry, something we've seen recently in #9446. To help in debugging such issues, some additional information is added to the trace messages. The series also fixes a bug in the error handling code of the partition index cache. Refs: #9446 Tests: unit(dev) " * 'index-reader-better-verify-end-state/v2.1' of https://github.com/denesb/scylla: sstables/index_reader: process_state(): add additional information to trace logging sstables/index_reader: verify_end_state(): add check for premature EOS sstables/index_reader: convert exception in verify_end_state() to malformed sstable exception sstables/index_reader: add const sstable& to index_consume_entry_context sstables/index_reader: remove unused members from index_consume_entry_context	2022-01-18 12:13:08 +02:00
Botond Dénes	afb14508c4	sstables/index_reader: verify_end_state(): add check for premature EOS Add a check which ensures that parsing ended in a valid state and not in the middle of a half-parsed entry.	2022-01-18 10:38:11 +02:00
Botond Dénes	518abe7415	test/lib/mutation_diff: force textual conversion If the compared mutations have binary keys, `colordiff` will declare the file as binary and will refuse to compare them, beyond a very unhelpful "binary files differ" summary. Add "-a" to the command line to force a treating all files as text. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20220117131347.106585-1-bdenes@scylladb.com>	2022-01-17 15:27:53 +02:00
Benny Halevy	25977db7b4	token_metadata: remove update_normal_token entry point It's currently used only by unit tests and it is dangerous to use on a populated token_metadata as update_normal_tokens assumes that the set of tokens owned by the given endpoint is compelte, i.e. previous tokens owned by the endpoint are no longer owned by it, but the single-token update_normal_token interface seems commulative (and has no documentation whatsoever). It is better to remove this interface and calculate a complete map of endpoint->tokens from the tests. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220117101242.122512-1-bhalevy@scylladb.com>	2022-01-17 12:18:42 +02:00
Nadav Har'El	8fd5041092	cql: INSERT JSON should refuse empty-string partition key Add the missing partition-key validation in INSERT JSON statements. Scylla, following the lead of Cassandra, forbids an empty-string partition key (please note that this is not the same as a null partition key, and that null clustering keys are allowed). Trying to INSERT, UPDATE or DELETE a partition with an empty string as the partition key fails with a "Key may not be empty". However, we had a loophole - you could insert such empty-string partition keys using an "INSERT ... JSON" statement. The problem was that the partition key validation was done in one place - `modification_statement::build_partition_keys()`. The INSERT, UPDATE and DELETE statements all inherited this same method and got the correct validation. But the INSERT JSON statement - insert_prepared_json_statement overrode the build_partition_keys() method and this override forgot to call the validation function. So in this patch we add the missing validation. Note that the validation function checks for more than just empty strings - there is also a length limit for partition keys. This patch also adds a cql-pytest reproducer for this bug. Before this patch, the test passed on Cassandra but failed on Scylla. Reported by @FortTell Fixes #9853. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220116085216.21774-1-nyh@scylladb.com>	2022-01-17 09:46:18 +01:00
Calle Wilund	97bb1be6f7	utils::is_timeout_exception: Ensure we handle nested exception types Fixes #9922 storage proxy uses is_timeout_exception to traverse different code paths. `a6202ae079` broke this (because bit rot and intermixing), by wrapping exception for information purposes. This adds check of nested types in exception handling, as well as a test for the routine itself.	2022-01-17 08:43:41 +00:00
Nadav Har'El	82005b91b6	test/cql-pytest: really flush() in translated Cassandra tests Some of the CQL tests translated from Cassandra into the test/cql-pytest framework used the flush() function to force a flush to sstables - presumably because this exercised yet another code path, or because it reproduced bugs that Cassandra once had that were only visible when reading from sstables - not from memtables. Until now, this flush() function was stubbed and did nothing. But we do have in test/cql-pytest a flush() implementation in nodetool.py - which uses the REST API if possible and if not (e.g., when running against Cassandra) uses the external "nodetool" command. So in this patch flush() starts to use nodetool.flush() instead of doing nothing. The tests continue to pass as before after this patch, and there is no noticable slowdown (the flush does take time, but the few times it's done is negligible in these tests). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220117073112.83994-1-nyh@scylladb.com>	2022-01-17 10:22:04 +02:00

1 2 3 4 5 ...

2705 Commits