scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-25 19:10:42 +00:00

Author	SHA1	Message	Date
Jan Ciolek	3f133cfa87	cql-pytest/test_unset: test unset value in UPDATEs with IF EXISTS Test what happens when an UNSET_VALUE is passed to an UPDATE statement with IF EXISTS condition. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-03-09 16:18:34 +01:00
Jan Ciolek	d66e23b265	cql-pytest/test_unset: test unset value in UPDATE statements Test what happens when an UNSET_VALUE is passed to an UPDATE statement. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-03-09 16:18:34 +01:00
Jan Ciolek	378e8761b9	cql-pytest/test_unset: test unset value in INSERTs with IF NOT EXISTS Add tests which test INSERT statements with IF NOT EXISTS, when an UNSET_VLAUE is passed for some column. The test are similar to the previous ones done for simple INSERTs without IF NOT EXISTS. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-03-09 16:18:34 +01:00
Jan Ciolek	fc26f6b850	cql-pytest/test_unset: test unset value in INSERT statements Add some tests which test what happens when an UNSET_VALUE is passed to an INSERT statement. Passing it for partition key column is impossible because python driver doesn't allow it. Passing it for clustering key column causes Scylla to silently ignore the INSERT. Passing it for a regular or static column causes this column to remain unchanged, as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-03-09 16:18:33 +01:00
Jan Ciolek	7663dc31b8	cas_request: fix crash on unset value in primary key with LWT Doing an LWT INSERT/UPDATE and passing UNSET_VALUE for the primary key column used to caused a crash. This is a minimal fix for this crash. Crash backtrace pointed to a place where we tried doing .front() on an empty vector of primary key ranges. I added a check that the vector isn't empty. If it's empty then let's throw an error and mention that it's most likely caused by an unset value. This has been fixed on master, but the PR that fixed it introduced breaking changes, which I don't want to add to branch-5.1. This fix is absolutely minimal - it performs the check at the last moment before a crash. It's not the prettiest, but it works and can't introduce breaking changes, because the new code gets activated only in cases that would've caused a crash. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-03-09 16:12:18 +01:00
Anna Stuchlik	2152b765a6	doc: Update the documentation landing page This commit makes the following changes to the docs landing page: - Adds the ScyllaDB enterprise docs as one of three tiles. - Modifies the three tiles to reflect the three flavors of ScyllaDB. - Moves the "New to ScyllaDB? Start here!" under the page title. - Renames "Our Products" to "Other Products" to list the products other than ScyllaDB itself. In addtition, the boxes are enlarged from to large-4 to look better. The major purpose of this commit is to expose the ScyllaDB documentation. docs: fix the link (cherry picked from commit `27bb8c2302`) Closes #13086	2023-03-06 14:19:30 +02:00
Pavel Emelyanov	d43b6db152	azure_snitch: Handle empty zone returned from IMDS Azure metadata API may return empty zone sometimes. If that happens shard-0 gets empty string as its rack, but propagates UNKNOWN_RACK to other shards. Empty zones response should be handled regardless. refs: #12185 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #12274 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-02 09:16:02 +03:00
Pavel Emelyanov	a514e60e65	snitch: Check http response codes to be OK Several snitch drivers make http requests to get region/dc/zone/rack/whatever from the cloud provider. They blindly rely on the response being successfull and read response body to parse the data they need from. That's not nice, add checks for requests finish with http OK statuses. refs: #12185 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #12287 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-02 09:16:02 +03:00
Anna Stuchlik	6eb70caba9	doc: fixes https://github.com/scylladb/scylladb/issues/12954 , adds the minimal version from which the 2021.1-to-2022.1 upgrade is supported for Ubuntu, Debian, and image Closes #12974 (cherry picked from commit `91b611209f`)	2023-02-28 13:03:05 +02:00
Botond Dénes	dd094f1230	types: unserialize_value for multiprecision_int,bool: don't read uninitialized memory Check the first fragment before dereferencing it, the fragment might be empty, in which case move to the next one. Found by running range scan tests with random schema and random data. Fixes: #12821 Fixes: #12823 Fixes: #12708 Closes #12824 (cherry picked from commit `ef548e654d`)	2023-02-23 22:38:24 +02:00
Yaron Kaikov	530600a646	release: prepare for 5.1.6 scylla-5.1.6	2023-02-23 14:28:29 +02:00
Kefu Chai	c8e5f8c66b	tools/schema_loader: do not return ref to a local variable we should never return a reference to local variable. so in this change, a reference to a static variable is returned instead. this should address following warning from Clang 17: ``` /home/kefu/dev/scylladb/tools/schema_loader.cc:146:16: error: returning reference to local temporary object [-Werror,-Wreturn-stack-address] return {}; ^~ ``` Fixes #12875 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #12876 (cherry picked from commit `6eab8720c4`)	2023-02-22 22:02:51 +02:00
Gleb Natapov' via ScyllaDB development	ca7a13cad2	lwt: upgrade stored mutations to the latest schema during prepare Currently they are upgraded during learn on a replica. The are two problems with this. First the column mapping may not exist on a replica if it missed this particular schema (because it was down for instance) and the mapping history is not part of the schema. In this case "Failed to look up column mapping for schema version" will be thrown. Second lwt request coordinator may not have the schema for the mutation as well (because it was freed from the registry already) and when a replica tries to retrieve the schema from the coordinator the retrieval will fail causing the whole request to fail with "Schema version XXXX not found" Both of those problems can be fixed by upgrading stored mutations during prepare on a node it is stored at. To upgrade the mutation its column mapping is needed and it is guarantied that it will be present at the node the mutation is stored at since it is pre-request to store it that the corresponded schema is available. After that the mutation is processed using latest schema that will be available on all nodes. Fixes #10770 Message-Id: <Y7/ifraPJghCWTsq@scylladb.com> (cherry picked from commit `15ebd59071`)	2023-02-22 21:58:30 +02:00
Tomasz Grabiec	d401551020	db: Fix trim_clustering_row_ranges_to() for non-full keys and reverse order trim_clustering_row_ranges_to() is broken for non-full keys in reverse mode. It will trim the range to position_in_partition_view::after_key(full_key) instead of position_in_partition_view::before_key(key), hence it will include the key in the resulting range rather than exclude it. Fixes #12180 Refs #1446 (cherry picked from commit `536c0ab194`)	2023-02-22 21:52:46 +02:00
Tomasz Grabiec	d952bf4035	types: Fix comparison of frozen sets with empty values A frozen set can be part of the clustering key, and with compact storage, the corresponding key component can have an empty value. Comparison was not prepared for this, the iterator attempts to deserialize the item count and will fail if the value is empty. Fixes #12242 (cherry picked from commit `232ce699ab`)	2023-02-22 21:44:37 +02:00
Michał Chojnowski	b346136e98	utils: config_file: fix handling of workdir,W in the YAML file Option names given in db/config.cc are handled for the command line by passing them to boost::program_options, and by YAML by comparing them with YAML keys. boost::program_options has logic for understanding the long_name,short_name syntax, so for a "workdir,W" option both --workdir and -W worked, as intended. But our YAML config parsing doesn't have this logic and expected "workdir,W" verbatim, which is obviously not intended. Fix that. Fixes #7478 Fixes #9500 Fixes #11503 Closes #11506 (cherry picked from commit `af7ace3926`)	2023-02-22 21:33:04 +02:00
Takuya ASADA	87e267213d	scylla_coredump_setup: fix coredump timeout settings We currently configure only TimeoutStartSec, but probably it's not enough to prevent coredump timeout, since TimeoutStartSec is maximum waiting time for service startup, and there is another directive to specify maximum service running time (RuntimeMaxSec). To fix the problem, we should specify RunTimeMaxSec and TimeoutSec (it configures both TimeoutStartSec and TimeoutStopSec). Fixes #5430 Closes #12757 (cherry picked from commit `bf27fdeaa2`)	2023-02-19 21:13:59 +02:00
Botond Dénes	5bda9356d5	Merge 'doc: fix the service name from "scylla-enterprise-server" "to "scylla-server"' from Anna Stuchlik Related https://github.com/scylladb/scylladb/issues/12658. This issue fixes the bug in the upgrade guides for the released versions. Closes #12679 * github.com:scylladb/scylladb: doc: fix the service name in the upgrade guide for patch releases versions 2022 doc: fix the service name in the upgrade guide from 2021.1 to 2022.1 (cherry picked from commit `325246ab2a`)	2023-02-17 12:20:26 +02:00
Botond Dénes	f9fe48ad89	Merge 'Backport compaction-backlog-tracker fixes to branch-5.1' from Raphael "Raph" Carvalho Both patches are important to fix inefficiencies when updating the backlog tracker, which can manifest as a reactor stall, on a special event like schema change. A simple conflict was resolved in the first patch, since master has compaction groups. It was very easy to resolve. Regression since `1d9f53c881`, which is present in 5.1 onwards. So probably it merits a backport to 5.2 too. Closes #12769 * github.com:scylladb/scylladb: compaction: Fix inefficiency when updating LCS backlog tracker table: Fix quadratic behavior when inserting sstables into tracker on schema change	2023-02-15 07:26:00 +02:00
Raphael S. Carvalho	0c9a0faf0d	compaction: Fix inefficiency when updating LCS backlog tracker LCS backlog tracker uses STCS tracker for L0. Turns out LCS tracker is calling STCS tracker's replace_sstables() with empty arguments even when higher levels (> 0) only had sstables replaced. This unnecessary call to STCS tracker will cause it to recompute the L0 backlog, yielding the same value as before. As LCS has a fragment size of 0.16G on higher levels, we may be updating the tracker multiple times during incremental compaction, which operates on SSTables on higher levels. Inefficiency is fixed by only updating the STCS tracker if any L0 sstable is being added or removed from the table. This may be fixing a quadratic behavior during boot or refresh, as new sstables are loaded one by one. Higher levels have a substantial higher number of sstables, therefore updating STCS tracker only when level 0 changes, reduces significantly the number of times L0 backlog is recomputed. Refs #12499. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12676 (cherry picked from commit `1b2140e416`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-02-07 12:45:34 -03:00
Raphael S. Carvalho	47dcfd866c	table: Fix quadratic behavior when inserting sstables into tracker on schema change Each time backlog tracker is informed about a new or old sstable, it will recompute the static part of backlog which complexity is proportional to the total number of sstables. On schema change, we're calling backlog_tracker::replace_sstables() for each existing sstable, therefore it produces O(N ^ 2) complexity. Fixes #12499. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12593 (cherry picked from commit `87ee547120`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-02-07 12:43:30 -03:00
Beni Peled	5c9ecd5604	release: prepare for 5.1.5 scylla-5.1.5	2023-02-06 14:47:49 +02:00
Anna Stuchlik	1a37b85d14	docs: fix the option name from compaction to compression on the Data Definition page Fixes the option name in the "Other table options" table on the Data Definition page. Fixes #12334 Closes #12382 (cherry picked from commit `ea7e23bf92`)	2023-02-05 20:06:31 +02:00
Botond Dénes	5070ddb723	sstables: track decompressed buffers Convert decompressed temporary buffers into tracked buffers just before returning them to the upper layer. This ensures these buffers are known to the reader concurrency semaphore and it has an accurate view of the actual memory consumption of reads. Fixes: #12448 Closes #12454 (cherry picked from commit `c4688563e3`)	2023-02-05 20:06:31 +02:00
Tomasz Grabiec	7480af58e5	row_cache: Fix violation of the "oldest version are evicted first" when evicting last dummy Consider the following MVCC state of a partition: v2: ==== <7> [entry2] ==== <9> ===== <last dummy> v1: ================================ <last dummy> [entry1] Where === means a continuous range and --- means a discontinuous range. After two LRU items are evicted (entry1 and entry2), we will end up with: v2: ---------------------- <9> ===== <last dummy> v1: ================================ <last dummy> [entry1] This will cause readers to incorrectly think there are no rows before entry <9>, because the range is continuous in v1, and continuity of a snapshot is a union of continuous intervals in all versions. The cursor will see the interval before <9> as continuous and the reader will produce no rows. This is only temporary, because current MVCC merging rules are such that the flag on the latest entry wins, so we'll end up with this once v1 is no longer needed: v2: ---------------------- <9> ===== <last dummy> ...and the reader will go to sstables to fetch the evicted rows before entry <9>, as expected. The bug is in rows_entry::on_evicted(), which treats the last dummy entry in a special way, and doesn't evict it, and doesn't clear the continuity by omission. The situation is not easy to trigger because it requires certain eviction pattern concurrent with multiple reads of the same partition in different versions, so across memtable flushes. Closes #12452 (cherry-picked from commit `f97268d8f2`) Fixes #12451.	2023-02-05 20:06:31 +02:00
Raphael S. Carvalho	43d46a241f	compaction: LCS: don't reshape all levels if only a single breaks disjointness LCS reshape is compacting all levels if a single one breaks disjointness. That's unnecessary work because rewriting that single level is enough to restore disjointness. If multiple levels break disjointness, they'll each be reshaped in its own iteration, so reducing operation time for each step and disk space requirement, as input files can be released incrementally. Incremental compaction is not applied to reshape yet, so we need to avoid "major compaction", to avoid the space overhead. But space overhead is not the only problem, the inefficiency, when deciding what to reshape when overlapping is detected, motivated this patch. Fixes #12495. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12496 (cherry picked from commit `f2f839b9cc`)	2023-02-05 20:06:31 +02:00
Wojciech Mitros	3807020a7b	forward_service: prevent heap use-after-free of forward_aggregates Currently, we create `forward_aggregates` inside a function that returns the result of a future lambda that captures these aggregates by reference. As a result, the aggregates may be destructed before the lambda finishes, resulting in a heap use-after-free. To prolong the lifetime of these aggregates, we cannot use a move capture, because the lambda is wrapped in a with_thread_if_needed() call on these aggregates. Instead, we fix this by wrapping the entire return statement in a do_with(). Fixes #12528 Closes #12533 (cherry picked from commit `5f45b32bfa`)	2023-02-05 20:06:31 +02:00
Botond Dénes	d6fb20f30e	types: is_tuple(): handle reverse types Currently reverse types match the default case (false), even though they might be wrapping a tuple type. One user-visible effect of this is that a schema, which has a reversed<frozen<UDT>> clustering key component, will have this component incorrectly represented in the schema cql dump: the UDT will loose the frozen attribute. When attempting to recreate this schema based on the dump, it will fail as the only frozen UDTs are allowed in primary key components. Fixes: #12576 Closes #12579 (cherry picked from commit `ebc100f74f`)	2023-02-05 20:06:31 +02:00
Calle Wilund	33c20eebe6	alterator::streams: Sort tables in list_streams to ensure no duplicates Fixes #12601 (maybe?) Sort the set of tables on ID. This should ensure we never generate duplicates in a paged listing here. Can obviously miss things if they are added between paged calls and end up with a "smaller" UUID/ARN, but that is to be expected. (cherry picked from commit `da8adb4d26`)	2023-02-05 20:06:28 +02:00
Benny Halevy	f3a6af663d	view: row_lock: lock_ck: find or construct row_lock under partition lock Since we're potentially searching the row_lock in parallel to acquiring the read_lock on the partition, we're racing with row_locker::unlock that may erase the _row_locks entry for the same clustering key, since there is no lock to protect it up until the partition lock has been acquired and the lock_partition future is resolved. This change moves the code to search for or allocate the row lock _after_ the partition lock has been acquired to make sure we're synchronously starting the read/write lock function on it, without yielding, to prevent this use-after-free. This adds an allocation for copying the clustering key in advance even if a row_lock entry already exists, that wasn't needed before. It only us slows down (a bit) when there is contention and the lock already existed when we want to go locking. In the fast path there is no contention and then the code already had to create the lock and copy the key. In any case, the penalty of copying the key once is tiny compared to the rest of the work that view updates are doing. This is required on top of `5007ded2c1` as seen in https://github.com/scylladb/scylladb/issues/12632 which is closely related to #12168 but demonstrates a different race causing use-after-free. Fixes #12632 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `4b5e324ecb`)	2023-02-05 17:38:29 +02:00
Anna Stuchlik	1685af9829	docs: fixes https://github.com/scylladb/scylladb/issues/12654 , update the links to the Download Center Closes #12655 (cherry picked from commit `64cc4c8515`)	2023-02-05 17:20:45 +02:00
Anna Stuchlik	bb880c7658	doc: fixes https://github.com/scylladb/scylladb/issues/12672 , fix the redirects to the Cloud docs Closes #12673 (cherry picked from commit `2be131da83`)	2023-02-05 17:17:46 +02:00
Kefu Chai	f952d397e8	cql3/selection: construct string_view using char* not size before this change, we construct a sstring from a comma statement, which evaluates to the return value of `name.size()`, but what we expect is `sstring(const char, size_t)`. in this change instead of passing the size of the string_view, both its address and size are used * `std::string_view` is constructed instead of sstring, for better performance, as we don't need to perform a deep copy the issue is reported by GCC-13: ``` In file included from cql3/selection/selectable.cc:11: cql3/selection/field_selector.hh:83:60: error: ignoring return value of function declared with 'nodiscard' attribute [-Werror,-Wunused-result] auto sname = sstring(reinterpret_cast<const char*>(name.begin(), name.size())); ^~~~~~~~~~ ``` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #12666 (cherry picked from commit `186ceea009`) Fixes #12739. (cherry picked from commit `b588b19620`)	2023-02-05 13:51:22 +02:00
Michał Chojnowski	5e88421360	commitlog: fix total_size_on_disk accounting after segment file removal Currently, segment file removal first calls `f.remove_file()` and does `total_size_on_disk -= f.known_size()` later. However, `remove_file()` resets `known_size` to 0, so in effect the freed space in not accounted for. `total_size_on_disk` is not just a metric. It is also responsible for deciding whether a segment should be recycled -- it is recycled only if `total_size_on_disk - known_size < max_disk_size`. Therefore this bug has dire performance consequences: if `total_size_on_disk - known_size` ever exceeds `max_disk_size`, the recycling of commitlog segments will stop permanently, because `total_size_on_disk - known_size` will never go back below `max_disk_size` due to the accounting bug. All new segments from this point will be allocated from scratch. The bug was uncovered by a QA performance test. It isn't easy to trigger -- it took the test 7 hours of constant high load to step into it. However, the fact that the effect is permanent, and degrades the performance of the cluster silently, makes the bug potentially quite severe. The bug can be easily spotted with Prometheus as infinitely rising `commitlog_total_size_on_disk` on the affected shards. Fixes #12645 Closes #12646 (cherry picked from commit `fa7e904cd6`)	2023-02-01 21:54:52 +02:00
Kamil Braun	1945102ca0	docs: fix problems with Raft documentation Fix some problems in the documentation, e.g. it is not possible to enable Raft in an existing cluster in 5.0, but the documentation claimed that it is. (cherry picked from commit `1cc68b262e`) Cherry-pick note: the original commit added a lot of new stuff like describing the Raft upgrade procedure, but also fixed problems with the existing documentation. In this backport we include only the latter. Closes #12582	2023-01-24 13:35:24 +02:00
Anna Mikhlin	be3f6f8c7b	release: prepare for 5.1.4 scylla-5.1.4	2023-01-22 15:29:48 +02:00
Nadav Har'El	94735f63a3	Merge 'doc: add the upgrade guide for ScyllaDB 5.1 to ScyllaDB Enterprise 2022.2' from Anna Stuchlik Fix https://github.com/scylladb/scylladb/issues/12315 This PR adds the upgrade guide from ScyllaDB 5.1 to ScyllaDB Enterprise 2022.2. Instead of adding separate guides per platform, I've merged the information to create one platform-agnostic guide, similar to what we did for [OSS->OSS](https://docs.scylladb.com/stable/upgrade/upgrade-opensource/upgrade-guide-from-5.0-to-5.1/) and [Enterprise->Enterprise ](https://github.com/scylladb/scylladb/pull/12339)guides. Closes #12450 * github.com:scylladb/scylladb: doc: add the new upgrade guide to the toctree and fix its name docs: add the upgrade guide from ScyllaDB 5.1 to ScyllaDB Enterprise 2022.2 (cherry picked from commit `7192283172`)	2023-01-20 14:29:59 +01:00
Michał Sala	b0d28919c0	forward_service: fix timeout support in parallel aggregates `forward_request` verb carried information about timeouts using `lowres_clock::time_point` (that came from local steady clock `seastar::lowres_clock`). The time point was produced on one node and later compared against other node `lowres_clock`. That behavior was wrong (`lowres_clock::time_point`s produced with different `lowres_clock`s cannot be compared) and could lead to delayed or premature timeout. To fix this issue, `lowres_clock::time_point` was replaced with `lowres_system_clock::time_point` in `forward_request` verb. Representation to which both time point types serialize is the same (64-bit integer denoting the count of elapsed nanoseconds), so it was possible to do an in-place switch of those types using logic suggested by @avikivity: - using steady_clock is just broken, so we aren't taking anything from users by breaking it further - once all nodes are upgraded, it magically starts to work Closes #12529 (cherry picked from commit `bbbe12af43`) Fixes #12458	2023-01-18 14:09:37 +02:00
Anna Mikhlin	addc4666d5	release: prepare for 5.1.3 scylla-5.1.3	2023-01-12 15:51:01 +02:00
Botond Dénes	a14ffbd5e2	Merge 'Backport 5.1 cleanup compaction flush memtable' from Benny Halevy This a backport of `9fa1783892` (#11902) to branch-5.1 Flush the memtable before cleaning up the table so not to leave any disowned tokens in the memtable as they might be resurrected if left in the memtable. Refs #1239 Closes #12490 * github.com:scylladb/scylladb: table: perform_cleanup_compaction: flush memtable table: add perform_cleanup_compaction api: storage_service: add logging for compaction operations et al	2023-01-11 08:03:35 +02:00
Benny Halevy	ea56ecace0	table: perform_cleanup_compaction: flush memtable We don't explicitly cleanup the memtable, while it might hold tokens disowned by the current node. Flush the memtable before performing cleanup compaction to make sure all tokens in the memtable are cleaned up. Note that non-owned ranges are invalidate in the cache in compaction_group::update_main_sstable_list_on_compaction_completion using desc.ranges_for_cache_invalidation. \Fixes #1239 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `eb3a94e2bc`)	2023-01-11 07:55:43 +02:00
Benny Halevy	fe8d8f97e2	table: add perform_cleanup_compaction Move the integration with compaction_manager from the api layer to the tabel class so it can also make sure the memtable is cleaned up in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `fc278be6c4`)	2023-01-11 07:51:29 +02:00
Benny Halevy	44e920cbb0	api: storage_service: add logging for compaction operations et al Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from `85523c45c0`)	2023-01-11 07:46:09 +02:00
Michał Chojnowski	b114551d53	configure: don't reduce parsers' optimization level to 1 in release The line modified in this patch was supposed to increase the optimization levels of parsers in debug mode to 1, because they were too slow otherwise. But as a side effect, it also reduced the optimization level in release mode to 1. This is not a problem for the CQL frontend, because statement preparation is not performance-sensitive, but it is a serious performance problem for Alternator, where it lies in the hot path. Fix this by only applying the -O1 to debug modes. Fixes #12463 Closes #12460 (cherry picked from commit `08b3a9c786`)	2023-01-08 01:34:56 +02:00
Nadav Har'El	099145fe9a	materialized view: fix bug in some large modifications to base partitions Sometimes a single modification to a base partition requires updates to a large number of view rows. A common example is deletion of a base partition containing many rows. A large BATCH is also possible. To avoid large allocations, we split the large amount of work into batch of 100 (max_rows_for_view_updates) rows each. The existing code assumed an empty result from one of these batches meant that we are done. But this assumption was incorrect: There are several cases when a base-table update may not need a view update to be generated (see can_skip_view_updates()) so if all 100 rows in a batch were skipped, the view update stopped prematurely. This patch includes two tests showing when this bug can happen - one test using a partition deletion with a USING TIMESTAMP causing the deletion to not affect the first 100 rows, and a second test using a specially-crafed large BATCH. These use cases are fairly esoteric, but in fact hit a user in the wild, which led to the discovery of this bug. The fix is fairly simple: To detect when build_some() is done it is no longer enough to check if it returned zero view-update rows; Rather, it explicitly returns whether or not it is done as an std::optional. The patch includes several tests for this bug, which pass on Cassandra, failed on Scylla before this patch, and pass with this patch. Fixes #12297. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12305 (cherry picked from commit `92d03be37b`)	2023-01-04 10:05:18 +02:00
Avi Kivity	0bdbce90f4	Merge 'reader_concurrency_semaphore: fix waiter/inactive race' from Botond Dénes We recently (in `7fbad8de87`) made sure all admission paths can trigger the eviction of inactive reads. As reader eviction happens in the background, a mechanism was added to make sure only a single eviction fiber was running at any given time. This mechanism however had a preemption point between stopping the fiber and releasing the evict lock. This gave an opportunity for either new waiters or inactive readers to be added, without the fiber acting on it. Since it still held onto the lock, it also prevented from other eviction fibers to start. This could create a situation where the semaphore could admit new reads by evicting inactive ones, but it still has waiters. Since an empty waitlist is also an admission criteria, once one waiter is wrongly added, many more can accumulate. This series fixes this by ensuring the lock is released in the instant the fiber decides there is no more work to do. It also fixes the assert failure on recursive eviction and adds a detection to the inactive/waiter contradiction. Fixes: #11923 Refs: #11770 Closes #12026 * github.com:scylladb/scylladb: reader_concurrency_semaphore: do_wait_admission(): detect admission-waiter anomaly reader_concurrency_semaphore: evict_readers_in_the_background(): eliminate blind spot reader_concurrency_semaphore: do_detach_inactive_read(): do a complete detach (cherry picked from commit `15ee8cfc05`)	2023-01-03 16:45:51 +02:00
Botond Dénes	8ffdb8546b	reader_concurrency_semaphore: unify admission logic across all paths The semaphore currently has two admission paths: the obtain_permit()/with_permit() methods which admits permits on user request (the front door) and the maybe_admit_waiters() which admits permits based on internal events like memory resource being returned (the back door). The two paths used their own admission conditions and naturally this means that they diverged in time. Notably, maybe_admit_waiters() did not look at inactive readers assuming that if there are waiters there cannot be inactive readers. This is not true however since we merged the execution-stage into the semaphore. Waiters can queue up even when there are inactive reads and thus maybe_admit_waiters() has to consider evicting some of them to see if this would allow for admitting new reads. To avoid such divergence in the future, the admission logic was moved into a new method can_admit_read() which is now shared between the two method families. This method now checks for the possibility of evicting inactive readers as well. The admission logic was tuned slightly to only consider evicting inactive readers if there is a real possibility that this will result in admissions: notably, before this patch, resource availability was checked before stalls were (used permits == blocked permits), so we could evict readers even if this couldn't help. Because now eviction can be started from maybe_admit_waiters(), which is also downstream from eviction, we added a flag to avoid recursive evict -> maybe admit -> evict ... loops. Fixes: #11770 Closes #11784 (cherry picked from commit `7fbad8de87`)	2023-01-03 16:45:17 +02:00
Takuya ASADA	db382697f1	scylla_setup: fix incorrect type definition on --online-discard option --online-discard option defined as string parameter since it doesn't specify "action=", but has default value in boolean (default=True). It breaks "provisioning in a similar environment" since the code supposed boolean value should be "action='store_true'" but it's not. We should change the type of the option to int, and also specify "choices=[0, 1]" just like --io-setup does. Fixes #11700 Closes #11831 (cherry picked from commit `acc408c976`)	2022-12-28 20:44:02 +02:00
Petr Gusev	3c02e5d263	cql: batch statement, inserting a row with a null key column should be forbidden Regular INSERT statements with null values for primary key components are rejected by Scylla since #9286 and #9314. Batch statements missed a similar check, this patch fixes it. Fixes: #12060 (cherry picked from commit `7730c4718e`)	2022-12-28 18:15:40 +02:00
Anna Mikhlin	4c0f7ea098	release: prepare for 5.1.2 scylla-5.1.2	2022-12-25 20:53:22 +02:00

1 2 3 4 5 ...

32556 Commits