scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 19:46:48 +00:00

Author	SHA1	Message	Date
Nadav Har'El	07c20bdfea	materialized view: fix bug in some large modifications to base partitions Sometimes a single modification to a base partition requires updates to a large number of view rows. A common example is deletion of a base partition containing many rows. A large BATCH is also possible. To avoid large allocations, we split the large amount of work into batch of 100 (max_rows_for_view_updates) rows each. The existing code assumed an empty result from one of these batches meant that we are done. But this assumption was incorrect: There are several cases when a base-table update may not need a view update to be generated (see can_skip_view_updates()) so if all 100 rows in a batch were skipped, the view update stopped prematurely. This patch includes two tests showing when this bug can happen - one test using a partition deletion with a USING TIMESTAMP causing the deletion to not affect the first 100 rows, and a second test using a specially-crafed large BATCH. These use cases are fairly esoteric, but in fact hit a user in the wild, which led to the discovery of this bug. The fix is fairly simple: To detect when build_some() is done it is no longer enough to check if it returned zero view-update rows; Rather, it explicitly returns whether or not it is done as an std::optional. The patch includes several tests for this bug, which pass on Cassandra, failed on Scylla before this patch, and pass with this patch. Fixes #12297. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12305 (cherry picked from commit `92d03be37b`)	2023-01-04 11:36:39 +02:00
Botond Dénes	8a36c4be54	evicatble_reader: avoid preemption pitfall around waiting for readmission Permits have to wait for re-admission after having been evicted. This happens via `reader_permit::maybe_wait_readmission()`. The user of this method -- the evictable reader -- uses it to re-wait admission when the underlying reader was evicted. There is one tricky scenario however, when the underlying reader is created for the first time. When the evictable reader is part of a multishard query stack, the created reader might in fact be a resumed, saved one. These readers are kept in an inactive state until actually resumed. The evictable reader shares it permit with the to-be-resumed reader so it can check whether it has been evicted while saved and needs to wait readmission before being resumed. In this flow it is critical that there is no preemption point between this check and actually resuming the reader, because if there is, the reader might end up actually recreated, without having waited for readmission first. To help avoid this situation, the existing `maybe_wait_readmission()` is split into two methods: * `bool reader_permit::needs_readmission()` * `future<> reader_permit::wait_for_readmission()` The evictable reader can now ensure there is no preemption point between `needs_readmission()` and resuming the reader. Fixes: #10187 Tests: unit(release) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20220315105851.170364-1-bdenes@scylladb.com> (cherry picked from commit `61028ad718`)	2023-01-04 11:20:28 +02:00
Avi Kivity	bf92c2b44c	Merge 'reader_concurrency_semaphore: fix waiter/inactive race' from Botond Dénes We recently (in `7fbad8de87`) made sure all admission paths can trigger the eviction of inactive reads. As reader eviction happens in the background, a mechanism was added to make sure only a single eviction fiber was running at any given time. This mechanism however had a preemption point between stopping the fiber and releasing the evict lock. This gave an opportunity for either new waiters or inactive readers to be added, without the fiber acting on it. Since it still held onto the lock, it also prevented from other eviction fibers to start. This could create a situation where the semaphore could admit new reads by evicting inactive ones, but it still has waiters. Since an empty waitlist is also an admission criteria, once one waiter is wrongly added, many more can accumulate. This series fixes this by ensuring the lock is released in the instant the fiber decides there is no more work to do. It also fixes the assert failure on recursive eviction and adds a detection to the inactive/waiter contradiction. Fixes: #11923 Refs: #11770 Closes #12026 * github.com:scylladb/scylladb: reader_concurrency_semaphore: do_wait_admission(): detect admission-waiter anomaly reader_concurrency_semaphore: evict_readers_in_the_background(): eliminate blind spot reader_concurrency_semaphore: do_detach_inactive_read(): do a complete detach (cherry picked from commit `15ee8cfc05`)	2023-01-03 16:46:44 +02:00
Botond Dénes	0e388d2140	reader_concurrency_semaphore: unify admission logic across all paths The semaphore currently has two admission paths: the obtain_permit()/with_permit() methods which admits permits on user request (the front door) and the maybe_admit_waiters() which admits permits based on internal events like memory resource being returned (the back door). The two paths used their own admission conditions and naturally this means that they diverged in time. Notably, maybe_admit_waiters() did not look at inactive readers assuming that if there are waiters there cannot be inactive readers. This is not true however since we merged the execution-stage into the semaphore. Waiters can queue up even when there are inactive reads and thus maybe_admit_waiters() has to consider evicting some of them to see if this would allow for admitting new reads. To avoid such divergence in the future, the admission logic was moved into a new method can_admit_read() which is now shared between the two method families. This method now checks for the possibility of evicting inactive readers as well. The admission logic was tuned slightly to only consider evicting inactive readers if there is a real possibility that this will result in admissions: notably, before this patch, resource availability was checked before stalls were (used permits == blocked permits), so we could evict readers even if this couldn't help. Because now eviction can be started from maybe_admit_waiters(), which is also downstream from eviction, we added a flag to avoid recursive evict -> maybe admit -> evict ... loops. Fixes: #11770 Closes #11784 (cherry picked from commit `7fbad8de87`)	2023-01-03 16:46:30 +02:00
Botond Dénes	288eb9d231	Merge 'Backport 5.0: cleanup compaction: flush memtable' from Benny Halevy This a backport of `9fa1783892` (#11902) to branch-5.0 Flush the memtable before cleaning up the table so not to leave any disowned tokens in the memtable as they might be resurrected if left in the memtable. Refs #1239 Closes #12415 * github.com:scylladb/scylladb: table: perform_cleanup_compaction: flush memtable table: add perform_cleanup_compaction api: storage_service: add logging for compaction operations et al	2023-01-03 12:23:03 +02:00
Benny Halevy	9219a59802	table: perform_cleanup_compaction: flush memtable We don't explicitly cleanup the memtable, while it might hold tokens disowned by the current node. Flush the memtable before performing cleanup compaction to make sure all tokens in the memtable are cleaned up. Note that non-owned ranges are invalidate in the cache in compaction_group::update_main_sstable_list_on_compaction_completion using desc.ranges_for_cache_invalidation. Fixes #1239 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from `eb3a94e2bc`)	2022-12-29 09:36:37 +02:00
Benny Halevy	f9cea4dc51	table: add perform_cleanup_compaction Move the integration with compaction_manager from the api layer to the tabel class so it can also make sure the memtable is cleaned up in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from `fc278be6c4`)	2022-12-29 09:36:37 +02:00
Benny Halevy	081b2b76cc	api: storage_service: add logging for compaction operations et al Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from `85523c45c0`)	2022-12-29 09:36:20 +02:00
Anna Mikhlin	dfb229a18a	release: prepare for 5.0.9	2022-12-29 09:25:47 +02:00
Takuya ASADA	60da855c2d	scylla_setup: fix incorrect type definition on --online-discard option --online-discard option defined as string parameter since it doesn't specify "action=", but has default value in boolean (default=True). It breaks "provisioning in a similar environment" since the code supposed boolean value should be "action='store_true'" but it's not. We should change the type of the option to int, and also specify "choices=[0, 1]" just like --io-setup does. Fixes #11700 Closes #11831 (cherry picked from commit `acc408c976`)	2022-12-28 20:44:12 +02:00
Benny Halevy	1718861e94	main: shutdown: do not abort on storage_io_error Do not abort in defer_verbose_shutdown if the callback throws storage_io_error, similar and in addition to the system errors handling that was added in `132c9d5933` As seen in https://github.com/scylladb/scylla/issues/9573#issuecomment-1148238291 Fixes #9573 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #10740 (cherry picked from commit `1daa7820c9`)	2022-12-28 19:29:17 +02:00
Petr Gusev	e03e9b1abe	cql: batch statement, inserting a row with a null key column should be forbidden Regular INSERT statements with null values for primary key components are rejected by Scylla since #9286 and #9314. Batch statements missed a similar check, this patch fixes it. Fixes: #12060 (cherry picked from commit `7730c4718e`)	2022-12-28 18:15:54 +02:00
Nadav Har'El	14077d2def	murmur3: fix inconsistent token for empty partition key Traditionally in Scylla and in Cassandra, an empty partition key is mapped to minimum_token() instead of the empty key's usual hash function (0). The reasons for this are unknown (to me), but one possibility is that having one known key that maps to the minimal token is useful for various iterations. In murmur3_partitioner.cc we have two variants of the token calculation function - the first is get_token(bytes_view) and the second is get_token(schema, partition_key_view). The first includes that empty- key special case, but the second was missing this special case! As Kamil first noted in #9352, the second variant is used when looking up partitions in the index file - so if a partition with an empty-string key is saved under one token, it will be looked up under a different token and not found. I reproduced exactly this problem when fixing issues #9364 and #9375 (empty-string keys in materialized views and indexes) - where a partition with an empty key was visible in a full-table scan but couldn't be found by looking up its key because of the wrong index lookup. I also tried an alternative fix - changing both implementations to return minimum_token (and not 0) for the empty key. But this is undesirable - minimum_token is not supposed to be a valid token, so the tokenizer and sharder may not return a valid replica or shard for it, so we shouldn't store data under such token. We also have have code (such as an increasing- key sanity check in the flat mutation reader) which assumes that no real key in the data can be minimum_token, and our plan is to start allowing data with an empty key (at least for materialized views). This patch does not risk a backward-incompatible disk format changes for two reasons: 1. In the current Scylla, there was no valid case where an empty partition key may appear. CQL and Thrift forbid such keys, and materialized-views and indexes also (incorrectly - see #9364, #9375) drop such rows. 2. Although Cassandra does allow empty partition keys, they is only allowed in materialized views and indexes - and we don't support reading materialized views generated by Cassandra (the user must re-generate them in Scylla). When #9364 and #9375 will be fixed by the next patch, empty partition keys will start appearing in Scylla (in materialized views and in the materialized view backing a secondary index), and this fix will become important. Fixes #9352 Refs #9364 Refs #9375 Signed-off-by: Nadav Har'El <nyh@scylladb.com> (cherry picked from commit `bc4d0fd5ad`)	2022-12-28 15:24:53 +02:00
Piotr Grabowski	25508705a8	type_json: fix wrong blob JSON validation Fixes wrong condition for validating whether a JSON string representing blob value is valid. Previously, strings such as "6" or "0392fa" would pass the validation, even though they are too short or don't start with "0x". Add those test cases to json_cql_query_test.cc. Fixes #10114 (cherry picked from commit `f8b67c9bd1`)	2022-12-28 15:17:31 +02:00
Botond Dénes	347da028e9	mutation_compactor: reset stop flag on page start When the mutation compactor has all the rows it needs for a page, it saves the decision to stop in a member flag: _stop. For single partition queries, the mutation compactor is kept alive across pages and so it has a method, start_new_page() to reset its state for the next page. This method didn't clear the _stop flag. This meant that the value set at the end of the previous could cause the new page and subsequently the entire query to be stopped prematurely. This can happen if the new page starts with a row that is covered by a higher level tombstone and is completely empty after compaction. Reset the _stop flag in start_new_page() to prevent this. This commit also adds a unit test which reproduces the bug. Fixes: #12361 Closes #12384 (cherry picked from commit `b0d95948e1`)	2022-12-25 09:45:50 +02:00
Yaron Kaikov	874fa15202	release: prepare for 5.0.8 scylla-5.0.8	2022-12-21 21:53:30 +02:00
Michał Chojnowski	99c03cb2af	sstables: index_reader: always evict the local cache gently Due to an oversight, the local index cache isn't evicted gently when _upper_bound existed. This is a source of reactor stalls. Fix that. Fixes #12271 Closes #12364 (cherry picked from commit `d9269abf5b`)	2022-12-21 13:43:26 +02:00
Botond Dénes	6c35d3c5cd	Merge 'Backport nodeops abort thread use-after-free patches' from Pavel Emelyanov This includes merges `396d9e6a46` and `2c021affd1` Things that got changed here: 1. All the node_ops_... stuff in storage_service was coroutinized after 5.0, so in this merge the changes were de-coroutinized back 2. Had to cherry-pick molding for UUID (`69fcc053bb` and `489e50ef3a`) 3. tracker::is_aborted() was added after 5.0, it caused minor context conflict 4. watchdog interval was changed, also caused minor context conflict refs: #10284 Closes #12335 * github.com:scylladb/scylladb: repair: use sharded abort_source to abort repair_info repair: node_ops_info: add start and stop methods storage_service: node_ops_abort_thread: abort all node ops on shutdown storage_service: node_ops_abort_thread: co_return only after printing log message storage_service: node_ops_meta_data: add start and stop methods repair: node_ops_info: prevent accidental copy repair: Remove ops_uuid repair: Remove abort_repair_node_ops() altogether repair: Subscribe on node_ops_info::as abortion repair: Keep abort source on node_ops_info repair: Pass node_ops_info arg to do_sync_data_using_repair() repair: Mark repair_info::abort() noexcept node_ops: Remove _aborted bit node_ops: Simplify construction of node_ops_metadata main: Fix message about repair service starting utils: uuid: make operator bool explicit utils: uuid: add null_uuid	2022-12-16 10:49:49 +02:00
Benny Halevy	707622ce15	repair: use sharded abort_source to abort repair_info Currently we use a single shared_ptr<abort_source> that can't be copied across shards. Instead, use a sharded<abort_source> in node_ops_info so that each repair_info instance will use an (optional) abort_source* on its own shard. Added respective start and stop methodsm plus a local_abort_source getter to get the shard-local abort_source (if available). Fixes #11826 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	bab36b604c	repair: node_ops_info: add start and stop methods Prepare for adding a sharded<abort_source> member. Wire start/stop in storage_service::node_ops_meta_data. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	8840711e79	storage_service: node_ops_abort_thread: abort all node ops on shutdown A later patch adds a sharded<abort_source> to node_ops_info. On shutdown, we must orderly stop it, so use node_ops_abort_thread shutdown path (where node_ops_singal_abort is called will a nullopt) to abort (and stop) all outstanding node_ops by passing a null_uuid to node_ops_abort, and let it iterate over all node ops to abort and stop them. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	af18bb3fe9	storage_service: node_ops_abort_thread: co_return only after printing log message Currently the function co_returns if (!uuid_opt) so the log info message indicating it's stopped is not printed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	6003cba7a8	storage_service: node_ops_meta_data: add start and stop methods Prepare for starting and stopping repair node_ops_info Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	e9afd076eb	repair: node_ops_info: prevent accidental copy Delete node_ops_info copy and move constructors before we add a sharded<abort_source> member for the per-shard repairs in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	c5f732d42a	repair: Remove ops_uuid It used to be used to abort repair_info by the corresponding node-ops uuid, but this code is no longer there, so it's good to drop the uuid as well Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	13a1408135	repair: Remove abort_repair_node_ops() altogether This code is dead after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	6685e00dd4	repair: Subscribe on node_ops_info::as abortion When node_ops_meta_data aborts it also kicks repair to find and abort all relevant repair_infos. Now it can be simplified by subscribing repair_meta on the abort source and aborting it without explicit kick Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	350bb57291	repair: Keep abort source on node_ops_info Next patches will need to subscribe on node_ops_meta_data's abort source inside repair code, so keep the pointer on node_ops_info too. At the same time, the node_ops_info::abort becomes obsolete, because the same check can be performed via the abort_source->abort_requested() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	e186ad5b6c	repair: Pass node_ops_info arg to do_sync_data_using_repair() Next patches will need to know more than the ops_uuid. The needed info is (well -- will be) sitting on node_ops_info, so pass it along Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	139e9afc89	repair: Mark repair_info::abort() noexcept Next patch will call it inside abort_source subscription callback which requires the calling code to be noexcept Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	a42c6f190c	node_ops: Remove _aborted bit A short cleanup "while at it" -- the node_ops_meta_data doesn't need to carry dedicated _aborted boolean -- the abort source that sets it is available instantly Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	2b8f0cbd97	node_ops: Simplify construction of node_ops_metadata It always constructs node_ops_info the same way Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Pavel Emelyanov	a2a762e18d	main: Fix message about repair service starting Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	aa973e2b9e	utils: uuid: make operator bool explicit Following up on `69fcc053bb` To prevent unintentional implicit conversions e.g. to a number. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220216081623.830627-1-bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	e0777f1112	utils: uuid: add null_uuid and respective bool predecate and operator and unit test. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220215113438.473400-1-bhalevy@scylladb.com>	2022-12-15 18:48:45 +03:00
Benny Halevy	cc6311cbc7	view: row_lock: lock_ck: serialize partition and row locking The problematic scenario this patch fixes might happen due to unfortunate serialization of locks/unlocks between lock_pk and lock_ck, as follows: 1. lock_pk acquires an exclusive lock on the partition. 2.a lock_ck attempts to acquire shared lock on the partition and any lock on the row. both cases currently use a fiber returning a future<rwlock::holder>. 2.b since the partition is locked, the lock_partition times out returning an exceptional future. lock_row has no such problem and succeeds, returning a future holding a rwlock::holder, pointing to the row lock. 3.a the lock_holder previously returned by lock_pk is destroyed, calling `row_locker::unlock` 3.b row_locker::unlock sees that the partition is not locked and erases it, including the row locks it contains. 4.a when_all_succeeds continuation in lock_ck runs. Since the lock_partition future failed, it destroyes both futures. 4.b the lock_row future is destroyed with the rwlock::holder value. 4.c ~holder attempts to return the semaphore units to the row rwlock, but the latter was already destroyed in 3.b above. Acquiring the partition lock and row lock in parallel doesn't help anything, but it complicates error handling as seen above, This patch serializes acquiring the row lock in lock_ck after locking the partition to prevent the above race. This way, erasing the unlocked partition is never expected to happen while any of its rows locks is held. Fixes #12168 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #12208 (cherry picked from commit `5007ded2c1`)	2022-12-13 14:52:01 +02:00
Anna Mikhlin	0354e13718	release: prepare for 5.0.7 scylla-5.0.7	2022-12-07 14:57:09 +02:00
Nadav Har'El	2750d2e94b	Merge 'alternator: fix wrong 'where' condition for GSI range key' from Marcin Maliszkiewicz Contains fixes requested in the issue (and some tiny extras), together with analysis why they don't affect the users (see commit messages). Fixes [ #11800](https://github.com/scylladb/scylladb/issues/11800) Closes #11926 * github.com:scylladb/scylladb: alternator: add maybe_quote to secondary indexes 'where' condition test/alternator: correct xfail reason for test_gsi_backfill_empty_string test/alternator: correct indentation in test_lsi_describe alternator: fix wrong 'where' condition for GSI range key (cherry picked from commit `ce7c1a6c52`)	2022-12-05 20:53:19 +02:00
Benny Halevy	b4383a389b	repair_reader: construct _reader_handle before _reader Currently, the `_reader` member is explicitly initialized with the result of the call to `make_reader`. And `make_reader`, as a side effect, assigns a value to the `_reader_handle` member. Since C++ initializes class members sequentially, in the order they are defined, the assignment to `_reader_handle` in `make_reader()` happens before `_reader_handle` is initialized. This patch fixes that by changing the definition order, and consequently, the member initialization order in the constructor so that `_reader_handle` will be (default-)initialized before the call to `make_reader()`, avoiding the undefined behavior. Fixes #10882 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #10883 (cherry picked from commit `9c231ad0ce`)	2022-12-05 20:33:58 +02:00
Nadav Har'El	f667c5923a	materialized views: fix view writes after base table schema change When we write to a materialized view, we need to know some information defined in the base table such as the columns in its schema. We have a "view_info" object that tracks each view and its base. This view_info object has a couple of mutable attributes which are used to lazily-calculate and cache the SELECT statement needed to read from the base table. If the base-table schema ever changes - and the code calls set_base_info() at that point - we need to forget this cached statement. If we don't (as before this patch), the SELECT will use the wrong schema and writes will no longer work. This patch also includes a reproducing test that failed before this patch, and passes afterwords. The test creates a base table with a view that has a non-trivial SELECT (it has a filter on one of the base-regular columns), makes a benign modification to the base table (just a silly addition of a comment), and then tries to write to the view - and before this patch it fails. Fixes #10026 Fixes #11542 (cherry picked from commit `2f2f01b045`)	2022-12-05 20:09:36 +02:00
Botond Dénes	e4ba0c56df	db/view/view_builder: don't drop partition and range tombstones when resuming The view builder builds the views from a given base table in view_builder::batch_size batches of rows. After processing this many rows, it suspends so the view builder can switch to building views for other base tables in the name of fairness. When resuming the build step for a given base table, it reuses the reader used previously (also serving the role of a snapshot, pinning sstables read from). The compactor however is created anew. As the reader can be in the middle of a partition, the view builder injects a partition start into the compactor to prime it for continuing the partition. This however only included the partition-key, crucially missing any active tombstones: partition tombstone or -- since the v2 transition -- active range tombstone. This can result in base rows covered by either of this to be resurrected and the view builder to generate view updates for them. This patch solves this by using the detach-state mechanism of the compactor which was explicitly developed for situations like this (in the range scan code) -- resuming a read with the readers kept but the compactor recreated. Also included are two test cases reproducing the problem, one with a range tombstone, the other with a partition tombstone. Fixes: #11668 Closes #11671 (cherry picked from commit `5621cdd7f9`)	2022-12-05 15:01:21 +02:00
Benny Halevy	329d55cc4f	configure: add --perf-tests-debuginfo option Provides separate control over debuginfo for perf tests since enabling --tests-debuginfo affects both today causing the Jenkins archives of perf tests binaries to inflate considerably. Refs https://github.com/scylladb/scylla-pkg/issues/3060 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `48021f3ceb`) Fixes #12191	2022-12-04 17:20:33 +02:00
Petr Gusev	b956293f47	modification_statement: fix LWT insert crash if clustering key is null PR #9314 fixed a similar issue with regular insert statements but missed the LWT code path. It's expected behaviour of modification_statement::create_clustering_ranges to return an empty range in this case, since possible_lhs_values it uses explicitly returns empty_value_set if it evaluates rhs to null, and it has a comment about it (All NULL comparisons fail; no column values match.) On the other hand, all components of the primary key are required to be set, this is checked at the prepare phase, in modification_statement::process_where_clause. So the only problem was modification_statement::execute_with_condition was not expecting an empty clustering_range in case of a null clustering key. Fixes: #11954 (cherry picked from commit `0d443dfd16`)	2022-12-04 15:00:27 +02:00
Nadav Har'El	6a8c2d3f56	Merge 'cql3: don't ignore other restrictions when a multi column restriction is present during filtering' from Jan Ciołek When filtering with multi column restriction present all other restrictions were ignored. So a query like: `SELECT * FROM WHERE pk = 0 AND (ck1, ck2) < (0, 0) AND regular_col = 0 ALLOW FILTERING;` would ignore the restriction `regular_col = 0`. This was caused by a bug in the filtering code: `2779a171fc/cql3/selection/selection.cc (L433-L449)` When multi column restrictions were detected, the code checked if they are satisfied and returned immediately. This is fixed by returning only when these restrictions are not satisfied. When they are satisfied the other restrictions are checked as well to ensure all of them are satisfied. This code was introduced back in 2019, when fixing #3574. Perhaps back then it was impossible to mix multi column and regular columns and this approach was correct. Fixes: #6200 Fixes: #12014 Closes #12031 * github.com:scylladb/scylladb: cql-pytest: add a reproducer for #12014, verify that filtering multi column and regular restrictions works boost/restrictions-test: uncomment part of the test that passes now cql-pytest: enable test for filtering combined multi column and regular column restrictions cql3: don't ignore other restrictions when a multi column restriction is present during filtering (cherry picked from commit `2d2034ea28`) Closes #12086	2022-11-26 14:24:08 +02:00
Piotr Grabowski	27a35c7f98	Udpate tools/jmx submodule (jackson dependency update) * tools/jmx 53f7f55...fe351e8 (1): > Update jackson dependency (cherry picked from commit `41b098f54e`) Refs #11929 Closes #11931	2022-11-20 20:10:14 +02:00
Pavel Emelyanov	d83134a245	Merge '[branch-5.0] multishard_mutation_query: don't unpop partition header of spent partition' from Botond Dénes When stopping the read, the multishard reader will dismantle the compaction state, pushing back (unpopping) the currently processed partition's header to its originating reader. This ensures that if the reader stops in the middle of a partition, on the next page the partition-header is re-emitted as the compactor (and everything downstream from it) expects. It can happen however that there is nothing more for the current partition in the reader and the next fragment is another partition. Since we only push back the partition header (without a partition-end) this can result in two partitions being emitted without being separated by a partition end. We could just add the missing partition-end when needed but it is pointless, if the partition has no more data, just drop the header, we won't need it on the next page. The missing partition-end can generate an "IDL frame truncated" message as it ends up causing the query result writer to create a corrupt partition entry. Fixes: https://github.com/scylladb/scylladb/issues/9482 Closes #11912 * github.com:scylladb/scylladb: test/cql-pytest: add regression test for "IDL frame truncated" error mutation_compactor: detach_state(): make it no-op if partition was exhausted	2022-11-16 11:50:50 +03:00
Anna Mikhlin	b844d14829	release: prepare for 5.0.6 scylla-5.0.6	2022-11-13 16:39:30 +02:00
Eliran Sinvani	184df0393e	cql: Fix crash upon use of the word empty for service level name Wrong access to an uninitialized token instead of the actual generated string caused the parser to crash, this wasn't detected by the ANTLR3 compiler because all the temporary variables defined in the ANTLR3 statements are global in the generated code. This essentialy caused a null dereference. Tests: 1. The fixed issue scenario from github. 2. Unit tests in release mode. Fixes #11774 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <20190612133151.20609-1-eliransin@scylladb.com> Closes #11777 (cherry picked from commit `ab7429b77d`)	2022-11-10 20:43:21 +02:00
Nadav Har'El	1b550dd301	cql3: fix cql3::util::maybe_quote() for keywords cql3::util::maybe_quote() is a utility function formatting an identifier name (table name, column name, etc.) that needs to be embedded in a CQL statement - and might require quoting if it contains non-alphanumeric characters, uppercase characters, or a CQL keyword. maybe_quote() made an effort to only quote the identifier name if neccessary, e.g., a lowercase name usually does not need quoting. But lowercase names that are CQL keywords - e.g., to or where - cannot be used as identifiers without quoting. This can cause problems for code that wants to generate CQL statements, such as the materialized-view problem in issue #9450 - where a user had a column called "to" and wanted to create a materialized view for it. So in this patch we fix maybe_quote() to recognize invalid identifiers by using the CQL parser, and quote them. This will quote reserved keywords, but not so-called unreserved keywords, which are allowed as identifiers and don't need quoting. This addition slows down maybe_quote(), but maybe_quote() is anyway only used in heavy operations which need to generate CQL. This patch also adds two tests that reproduce the bug and verify its fix: 1. Add to the low-level maybe_quote() test (a C++ unit test) also tests that maybe_quote() quotes reserved keywords like "to", but doesn't quote unreserved keywords like "int". 2. Add a test reproducing issue #9450 - creating a materialized view whose key column is a keyword. This new test passes on Cassandra, failed on Scylla before this patch, and passes after this patch. It is worth noting that maybe_quote() now has a "forward compatiblity" problem: If we save CQL statements generated by maybe_quote(), and a future version introduces a new reserved keyword, the parser of the future version may not be able to parse the saved CQL statement that was generated with the old mayb_quote() and didn't quote what is now a keyword. This problem can be solved in two ways: 1. Try hard not to introduced new reserved keywords. Instead, introduce unreserved keywords. We've been doing this even before recognizing this maybe_quote() future-compatibility problem. 2. In the next patch we will introduce quote() - which unconditionally quotes identifier names, even if lowercase. These quoted names will be uglier for lowercase names - but will be safe from future introduction of new keywords. So we can consider switching some or all uses of maybe_quote() to quote(). Fixes #9450 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220118161217.231811-1-nyh@scylladb.com> (cherry picked from commit `5d2f694a90`)	2022-11-07 17:01:32 +02:00
Alexander Turetskiy	01ce53d7fb	Alternator: Projection field added to return from DescribeTable which describes GSIs and LSIs. The return from DescribeTable which describes GSIs and LSIs is missing the Projection field. We do not yet support all the settings Projection (see #5036), but the default which we support is ALL, and DescribeTable should return that in its description. Fixes #11470 Closes #11693 (cherry picked from commit `636e14cc77`)	2022-11-07 17:01:32 +02:00

1 2 3 4 5 ...

30158 Commits