scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-29 04:37:00 +00:00

Author	SHA1	Message	Date
Nadav Har'El	bf7fc9709d	test/rest_api: fix flaky test for toppartitions The REST test test_storage_service.py::test_toppartitions_pk_needs_escaping was flaky. It tests the toppartition request, which unfortunately needs to choose a sampling duration in advance, and we chose 1 second which we considered more than enough - and indeed typically even 1ms is enough! but very rarely (only know of only one occurance, in issue #13223) one second is not enough. Instead of increasing this 1 second and making this test even slower, this patch takes a retry approach: The tests starts with a 0.01 second duration, and is then retried with increasing durations until it succeeds or a 5-seconds duration is reached. This retry approach has two benefits: 1. It de-flakes the test (allowing a very slow test to take 5 seconds instead of 1 seconds which wasn't enough), and 2. At the same time it makes a successful test much faster (it used to always take a full second, now it takes 0.07 seconds on a dev build on my laptop). A failed test may, in some cases, take 10 seconds after this patch (although in some other cases, an error will be caught immediately), but I consider this acceptable - this test should pass, after all, and a failure indicates a regression and taking 10 seconds will be the last of our worries in that case. Fixes #13223. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13238 (cherry picked from commit `c550e681d7`)	2023-04-27 19:16:58 +03:00
Nadav Har'El	00a8c3a433	test/alternator: increase CQL connection timeout This patch increases the connection timeout in the get_cql_cluster() function in test/cql-pytest/run.py. This function is used to test that Scylla came up, and also test/alternator/run uses it to set up the authentication - which can only be done through CQL. The Python driver has 2-second and 5-second default timeouts that should have been more than enough for everybody (TM), but in #13239 we saw that in one case it apparently wasn't enough. So to be extra safe, let's increase the default connection-related timeouts to 60 seconds. Note this change only affects the Scylla boot in the test/*/run scripts, and it does not affect the actual tests - those have different code to connect to Scylla (see cql_session() in test/cql-pytest/util.py), and we already increased the timeouts there in #11289. Fixes #13239 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13291 (cherry picked from commit `4fdcee8415`)	2023-04-27 19:15:39 +03:00
Tomasz Grabiec	c08ed39a33	direct_failure_detector: Avoid throwing exceptions in the success path sleep_abortable() is aborted on success, which causes sleep_aborted exception to be thrown. This causes scylla to throw every 100ms for each pinged node. Throwing may reduce performance if happens often. Also, it spams the logs if --logger-log-level exception=trace is enabled. Avoid by swallowing the exception on cancellation. Fixes #13278. Closes #13279 (cherry picked from commit `99cb948eac`)	2023-04-27 19:14:31 +03:00
Kefu Chai	04424f8956	test: cql-pytest: test_describe: clamp bloom filter's fp rate before this change, we use `round(random.random(), 5)` for the value of `bloom_filter_fp_chance` config option. there are chances that this expression could return a number lower or equal to 6.71e-05. but we do have a minimal for this option, which is defined by `utils::bloom_calculations::probs`. and the minimal false positive rate is 6.71e-05. we are observing test failures where the we are using 0 for the option, and scylla right rejected it with the error message of ``` bloom_filter_fp_chance must be larger than 6.71e-05 and less than or equal to 1.0 (got 0) ```. so, in this change, to address the test failure, we always use a number slightly greater or equal to a number slightly greater to the minimum to ensure that the randomly picked number is in the range of supported false positive rate. Fixes #13313 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13314 (cherry picked from commit `33f4012eeb`)	2023-04-27 19:12:53 +03:00
Beni Peled	429b696bbc	release: prepare for 5.2.0 scylla-5.2.0	2023-04-27 16:26:43 +03:00
Beni Peled	a89867d8c2	release: prepare for 5.2.0-rc5 scylla-5.2.0-rc5	2023-04-25 14:37:54 +03:00
Benny Halevy	6ad94fedf3	utils: clear_gently: do not clear null unique_ptr Otherwise the null pointer is dereferenced. Add a unit test reproducing the issue and testing this fix. Fixes #13636 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `12877ad026`)	2023-04-24 17:51:01 +03:00
Anna Stuchlik	a6188d6abc	doc: document `tombstone_gc` as not experimental The tombstone_gc was documented as experimental in version 5.0. It is no longer experimental in version 5.2. This commit updates the information about the option. Closes #13469 (cherry picked from commit `a68b976c91`)	2023-04-24 11:54:06 +03:00
Botond Dénes	50095cc3a5	Merge 'db: system_keyspace: use microsecond resolution for group0_history range tombstone' from Kamil Braun in `make_group0_history_state_id_mutation`, when adding a new entry to the group 0 history table, if the parameter `gc_older_than` is engaged, we create a range tombstone in the mutation which deletes entries older than the new one by `gc_older_than`. In particular if `gc_older_than = 0`, we want to delete all older entries. There was a subtle bug there: we were using millisecond resolution when generating the tombstone, while the provided state IDs used microsecond resolution. On a super fast machine it could happen that we managed to perform two schema changes in a single millisecond; this happened sometimes in `group0_test.test_group0_history_clearing_old_entries` on our new CI/promotion machines, causing the test to fail because the tombstone didn't clear the entry correspodning to the previous schema change when performing the next schema change (since they happened in the same millisecond). Use microsecond resolution to fix that. The consecutive state IDs used in group 0 mutations are guaranteed to be strictly monotonic at microsecond resolution (see `generate_group0_state_id` in service/raft/raft_group0_client.cc). Fixes #13594 Closes #13604 * github.com:scylladb/scylladb: db: system_keyspace: use microsecond resolution for group0_history range tombstone utils: UUID_gen: accept decimicroseconds in min_time_UUID (cherry picked from commit `10c1f1dc80`)	2023-04-23 16:03:02 +03:00
Botond Dénes	7b2215d8e0	Merge 'Backport bugfixes regarding UDT, UDF, UDA interactions to branch-5.2' from Wojciech Mitros This patch backports https://github.com/scylladb/scylladb/pull/12710 to branch-5.2. To resolve the conflicts that it's causing, it also includes * https://github.com/scylladb/scylladb/pull/12680 * https://github.com/scylladb/scylladb/pull/12681 Closes #13542 * github.com:scylladb/scylladb: uda: change the UDF used in a UDA if it's replaced functions: add helper same_signature method uda: return aggregate functions as shared pointers udf: also check reducefunc to confirm that a UDF is not used in a UDA udf: fix dropping UDFs that share names with other UDFs used in UDAs pytest: add optional argument for new_function argument types udt: disallow dropping a user type used in a user function	2023-04-19 01:38:08 -04:00
Botond Dénes	da9f90362d	Merge 'Compaction reevaluation bug fixes' from Raphael "Raph" Carvalho A problem in compaction reevaluation can cause the SSTable set to be left uncompacted for indefinite amount of time, potentially causing space and read amplification to be suboptimal. Two revaluation problems are being fixed, one after off-strategy compaction ended, and another in compaction manager which intends to periodically reevaluate a need for compaction. Fixes https://github.com/scylladb/scylladb/issues/13429. Fixes https://github.com/scylladb/scylladb/issues/13430. Closes #13431 * github.com:scylladb/scylladb: compaction: Make compaction reevaluation actually periodic replica: Reevaluate regular compaction on off-strategy completion (cherry picked from commit `9a02315c6b`)	2023-04-19 01:14:33 -04:00
Botond Dénes	c9a17c80f6	mutation/mutation_compactor: consume_partition_end(): reset _stop The purpose of `_stop` is to remember whether the consumption of the last partition was interrupted or it was consumed fully. In the former case, the compactor allows retreiving the compaction state for the given partition, so that its compaction can be resumed at a later point in time. Currently, `_stop` is set to `stop_iteration::yes` whenever the return value of any of the `consume()` methods is also `stop_iteration::yes`. Meaning, if the consuming of the partition is interrupted, this is remembered in `_stop`. However, a partition whose consumption was interrupted is not always continued later. Sometimes consumption of a partitions is interrputed because the partition is not interesting and the downstream consumer wants to stop it. In these cases the compactor should not return an engagned optional from `detach_state()`, because there is not state to detach, the state should be thrown away. This was incorrectly handled so far and is fixed in this patch, but overwriting `_stop` in `consume_partition_end()` with whatever the downstream consumer returns. Meaning if they want to skip the partition, then `_stop` is reset to `stop_partition::no` and `detach_state()` will return a disengaged optional as it should in this case. Fixes: #12629 Closes #13365 (cherry picked from commit `bae62f899d`)	2023-04-18 02:32:24 -04:00
Wojciech Mitros	7242c42089	uda: change the UDF used in a UDA if it's replaced Currently, if a UDA uses a UDF that's being replaced, the UDA will still keep using the old UDF until the node is restarted. This patch fixes this behavior by checking all UDAs when replacing a UDF and updating them if necessary. Fixes #12709 (cherry picked from commit `02bfac0c66`)	2023-04-17 13:14:46 +02:00
Wojciech Mitros	70ff69afab	functions: add helper same_signature method When deciding whether two functions have the same signature, we have to check if they have the same name and parameter types. Additionally, if they're represented by pointers, we need to check if any of them is a nullptr. This logic is used multiple times, so it's extracted to a separate function. To use this function, the `used_by_user_aggregate` method takes now a function instead of name and types list - we can do it because we always use it with an existing user function (that we're trying to drop). The method will also be useful when we'll be not dropping, but replacing a user function. (cherry picked from commit `58987215dc`)	2023-04-17 13:14:40 +02:00
Wojciech Mitros	5fd4bb853b	uda: return aggregate functions as shared pointers We will want to reuse the functions that we get from an aggregate without making a deep copy, and it's only possible if we get pointers from the aggregate instead of actual values. (cherry picked from commit `20069372e7`)	2023-04-17 13:14:24 +02:00
Wojciech Mitros	313649e86d	udf: also check reducefunc to confirm that a UDF is not used in a UDA When dropping a UDF we're checking if it's not begin used in any UDAs and fail otherwise. However, we're only checking its state function and final function, and it may also be used as its reduce function. This patch adds the missing checks and a test for them. (cherry picked from commit `ef1dac813b`)	2023-04-17 13:14:16 +02:00
Wojciech Mitros	14d8cec130	udf: fix dropping UDFs that share names with other UDFs used in UDAs Currently, when dropping a function, we only check if there exist an aggregate that uses a function with the same name as its state function or final function. This may cause the drop to fail even when it's just another UDF with the same name that's used in the aggregate, even when the actual dropped function is not used there. This patch fixes this by checking whether not only the name of the UDA's sfunc and finalfunc, but also their argument types. (cherry picked from commit `49077dd144`)	2023-04-17 13:14:09 +02:00
Wojciech Mitros	203cbb79a1	pytest: add optional argument for new_function argument types When multiple functions with the same name but different argument types are created, the default drop statement for these functions will fail because it does not include the argument types. With this change, this problem can be worked around by specifying argument types when creating the function, as this will cause the drop statement to include them. (cherry picked from commit `8791b0faf5`)	2023-04-17 13:13:59 +02:00
Wojciech Mitros	51f19d1b8c	udt: disallow dropping a user type used in a user function Currently, nothing prevents us from dropping a user type used in a user function, even though doing so may make us unable to use the function correctly. This patch prevents this behavior by checking all function argument and return types when executing a drop type statement and preventing it from completing if the type is referenced by any of them. (cherry picked from commit `86c61828e6`)	2023-04-17 13:13:35 +02:00
Anna Stuchlik	83735ae77f	doc: update the metrics between 5.2 and 2023.1 Related: https://github.com/scylladb/scylla-enterprise/issues/2794 This commit adds the information about the metric changes in version 2023.1 compared to version 5.2. This commit is part of the 5.2-to-2023.1 upgrade guide and must be backported to branch-5.2. Closes #13506 (cherry picked from commit `989a75b2f7`)	2023-04-17 11:29:43 +02:00
Avi Kivity	9d384e3af2	Merge 'Backport "reader_concurrency_semaphore: don't evict inactive readers needlessly" to branch-5.2' from Botond Dénes The patch doesn't apply cleanly, so a targeted backport PR was necessary. I also needed to cherry-pick two patches from https://github.com/scylladb/scylladb/pull/13255 that the backported patch depends on. Decided against backporting the entire https://github.com/scylladb/scylladb/pull/13255 as it is quite an intrusive change. Fixes: https://github.com/scylladb/scylladb/issues/11803 Closes #13515 * github.com:scylladb/scylladb: reader_concurrency_semaphore: don't evict inactive readers needlessly reader_concurrency_semaphore: add stats to record reason for queueing permits reader_concurrency_semaphore: can_admit_read(): also return reason for rejection	2023-04-17 12:25:21 +03:00
Nadav Har'El	0da0c94f49	cql: USING TTL 0 means unlimited, not default TTL Our documentation states that writing an item with "USING TTL 0" means it should never expire. This should be true even if the table has a default TTL. But Scylla mistakenly handled "USING TTL 0" exactly like having no USING TTL at all (i.e., it took the default TTL, instead of unlimited). We had two xfailing tests demonstrating that Scylla's behavior in this is different from Cassandra. Scylla's behavior in this case was also undocumented. By the way, Cassandra used to have the same bug (CASSANDRA-11207) but it was fixed already in 2016 (Cassandra 3.6). So in this patch we fix Scylla's "USING TTL 0" behavior to match the documentation and Cassandra's behavior since 2016. One xfailing test starts to pass and the second test passes this bug and fails on a different one. This patch also adds a third test for "USING TTL ?" with UNSET_VALUE - it behaves, on both Scylla and Cassandra, like a missing "USING TTL". The origin of this bug was that after parsing the statement, we saved the USING TTL in an integer, and used 0 for the case of no USING TTL given. This meant that we couldn't tell if we have USING TTL 0 or no USING TTL at all. This patch uses an std::optional so we can tell the case of a missing USING TTL from the case of USING TTL 0. Fixes #6447 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13079 (cherry picked from commit `a4a318f394`)	2023-04-17 10:41:08 +03:00
Nadav Har'El	1a9f51b767	cql: fix empty aggregation, and add more tests This patch fixes #12475, where an aggregation (e.g., COUNT(*), MIN(v)) of absolutely no partitions (e.g., "WHERE p = null" or "WHERE p in ()") resulted in an internal error instead of the "zero" result that each aggregator expects (e.g., 0 for COUNT, null for MIN). The problem is that normally our aggregator forwarder picks the nodes which hold the relevant partition(s), forwards the request to each of them, and then combines these results. When there are no partitions, the query is sent to no node, and we end up with an empty result set instead of the "zero" results. So in this patch we recognize this case and build those "zero" results (as mentioned above, these aren't always 0 and depend on the aggregation function!). The patch also adds two tests reproducing this issue in a fairly general way (e.g., several aggregators, different aggregation functions) and confirming the patch fixes the bug. The test also includes two additional tests for COUNT aggregation, which uncovered an incompatibility with Cassandra which is still not fixed - so these tests are marked "xfail": Refs #12477: Combining COUNT with GROUP by results with empty results in Cassandra, and one result with empty count in Scylla. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12715 (cherry picked from commit `3ba011c2be`)	2023-04-17 10:41:08 +03:00
Raphael S. Carvalho	dba0e604a7	table: Fix disk-space related metrics total disk space used metric is incorrectly telling the amount of disk space ever used, which is wrong. It should tell the size of all sstables being used + the ones waiting to be deleted. live disk space used, by this defition, shouldn't account the ones waiting to be deleted. and live sstable count, shouldn't account sstables waiting to be deleted. Fix all that. Fixes #12717. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `529a1239a9`)	2023-04-16 22:14:01 +03:00
Michał Chojnowski	4ea67940cb	locator: token_metadata: get rid of a quadratic behaviour in get_address_ranges() Some callees of update_pending_ranges use the variant of get_address_ranges() which builds a hashmap of all <endpoint, owned range> pairs. For everywhere_topology, the size of this map is quadratic in the number of endpoints, making it big enough to cause contiguous allocations of tens of MiB for clusters of realistic size, potentially causing trouble for the allocator (as seen e.g. in #12724). This deserves a correction. This patch removes the quadratic variant of get_address_ranges() and replaces its uses with its linear counterpart. Refs #10337 Refs #10817 Refs #10836 Refs #10837 Fixes #12724 (cherry picked from commit `9e57b21e0c`)	2023-04-16 21:59:14 +03:00
Jan Ciolek	a8c49c44e5	cql/query_options: add a check for missing bind marker name There was a missing check in validation of named bind markers. Let's say that a user prepares a query like: ```cql INSERT INTO ks.tab (pk, ck, v) VALUES (:pk, :ck, :v) ``` Then they execute the query, but specify only values for `:pk` and `:ck`. We should detect that a value for :v is missing and throw an invalid_request_exception. Until now there was no such check, in case of a missing variable invalid `query_options` were created and Scylla crashed. Sadly it's impossible to create a regression test using `cql-pytest` or `boost`. `cql-pytest` uses the python driver, which silently ignores mising named bind variables, deciding that the user meant to send an UNSET_VALUE for them. When given values like `{'pk': 1, 'ck': 2}`, it will automaticaly extend them to `{'pk': 1, 'ck': 2, 'v': UNSET_VALUE}`. In `boost` I tried to use `cql_test_env`, but it only has methods which take valid `query_options` as a parameter. I could create a separate unit tests for the creation and validation of `query_options` but it won't be a true end-to-end test like `cql-pytest`. The bug was found using the rust driver, the reproducer is available in the issue description. Fixes: #12727 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com> Closes #12730 (cherry picked from commit `2a5ed115ca`)	2023-04-16 21:57:28 +03:00
Nadav Har'El	12a29edf90	test/alternator: fix flaky test for partition-tombstone scan The test test_scan.py::test_scan_long_partition_tombstone_string checks that a full-table Scan operation ends a page in the middle of a very long string of partition tombstones, and does NOT scan the entire table in one page (if we did that, getting a single page could take an unbounded amount of time). The test is currently flaky, having failed in CI runs three times in the past two months. The reason for the flakiness is that we don't know exactly how long we need to make the sequence of partition tombstones in the test before we can be absolutely sure a single page will not read this entire sequence. For single-partition scans we have the "query_tombstone_page_limit" configuration parameter, which tells us exactly how long we need to make the sequence of row tombstones. But for a full-table scan of partition tombstones, the situation is more complicated - because the scan is done in parallel on several vnodes in parallel and each of them needs to read query_tombstone_page_limit before it stops. In my experiments, using query_tombstone_limit * 4 consecutive tombstones was always enough - I ran this test hundreds of times and it didn't fail once. But since it did fail on Jenkins very rarely (3 times in the last two months), maybe the multiplier 4 isn't enough. So this patch doubles it to 8. Hopefully this would be enough for anyone (TM). This makes this test even bigger and slower than it was. To make it faster, I changed this test's write isolation mode from the default always_use_lwt to forbid_rmw (not use LWT). This leaves the test's total run time to be similar to what it was before this patch - around 0.5 seconds in dev build mode on my laptop. Fixes #12817 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12819 (cherry picked from commit `14cdd034ee`)	2023-04-14 11:54:45 +03:00
Botond Dénes	3e10c3fc89	reader_concurrency_semaphore: don't evict inactive readers needlessly Inactive readers should only be evicted to free up resources for waiting readers. Evicting them when waiters are not admitted for any other reason than resources is wasteful and leads to extra load later on when these evicted readers have to be recreated end requeued. This patch changes the logic on both the registering path and the admission path to not evict inactive readers unless there are readers actually waiting on resources. A unit-test is also added, reproducing the overly-agressive eviction and checking that it doesn't happen anymore. Fixes: #11803 Closes #13286 (cherry picked from commit `bd57471e54`)	2023-04-14 10:37:30 +03:00
Botond Dénes	f11deb5074	reader_concurrency_semaphore: add stats to record reason for queueing permits When diagnosing problems, knowing why permits were queued is very valuable. Record the reason in a new stats, one for each reason a permit can be queued. (cherry picked from commit `7b701ac52e`)	2023-04-14 10:37:30 +03:00
Botond Dénes	1baf9dddd7	reader_concurrency_semaphore: can_admit_read(): also return reason for rejection So caller can bump the appropriate counters or log the reason why the the request cannot be admitted. (cherry picked from commit `bb00405818`)	2023-04-14 09:30:02 +03:00
Kamil Braun	9717ff5057	docs: cleaning up after failed membership change After a failed topology operation, like bootstrap / decommission / removenode, the cluster might contain a garbage entry in either token ring or group 0. This entry can be cleaned-up by executing removenode on any other node, pointing to the node that failed to bootstrap or leave the cluster. Document this procedure, including a method of finding the host ID of a garbage entry. Add references in other documents. Fixes: #13122 Closes #13186 (cherry picked from commit `c2a2996c2b`)	2023-04-13 10:35:02 +02:00
Anna Stuchlik	b293b1446f	doc: remove Enterprise upgrade guides from OSS doc This commit removes the Enterprise upgrade guides from the Open Source documentation. The Enterprise upgrade guides should only be available in the Enterprise documentation, with the source files stored in scylla-enterprise.git. In addition, this commit: - adds the links to the Enterprise user guides in the Enterprise documentation at https://enterprise.docs.scylladb.com/ - adds the redirections for the removed pages to avoid breaking any links. This commit must be reverted in scylla-enterprise.git. (cherry picked from commit `61bc05ae49`) Closes #13473	2023-04-11 14:26:35 +03:00
Yaron Kaikov	e6f7ac17f6	doc: update supported os for 2022.1 ubuntu22.04 is already supported on both `5.0` and `2022.1` updating the table Closes #13340 (cherry picked from commit `c80ab78741`)	2023-04-05 13:56:07 +03:00
Anna Stuchlik	36619fc7d9	doc: add upgrade guide from 5.2 to 2023.1 Related: https://github.com/scylladb/scylla-enterprise/issues/2770 This commit adds the upgrade guide from ScyllaDB Open Source 5.2 to ScyllaDB Enterprise 2023.1. This commit does not cover metric updates (the metrics file has no content, which needs to be added in another PR). As this is an upgrade guide, this commit must be merged to master and backported to branch-5.2 and branch-2023.1 in scylla-enterprise.git. Closes #13294 (cherry picked from commit `595325c11b`)	2023-04-05 06:43:01 +03:00
Anna Stuchlik	750414c196	doc: update Raft doc for versions 5.2 and 2023.1 Fixes https://github.com/scylladb/scylladb/issues/13345 Fixes https://github.com/scylladb/scylladb/issues/13421 This commit updates the Raft documentation page to be up to date in versions 5.2 and 2023.1. - Irrelevant information about previous releases is removed. - Some information is clarified. - Mentions of version 5.2 are either removed (if possible) or version 2023.1 is added. Closes #13426 (cherry picked from commit `447ce58da5`)	2023-04-05 06:42:28 +03:00
Botond Dénes	128050e984	Merge 'commitlog: Fix updating of total_size_on_disk on segment alloc when o_dsync is off' from Calle Wilund Fixes #12810 We did not update total_size_on_disk in commitlog totals when use o_dsync was off. This means we essentially ran with no registered footprint, also causing broken comparisons in delete_segments. Closes #12950 * github.com:scylladb/scylladb: commitlog: Fix updating of total_size_on_disk on segment alloc when o_dsync is off commitlog: change type of stored size (cherry picked from commit `e70be47276`)	2023-04-03 08:57:43 +03:00
Yaron Kaikov	d70751fee3	release: prepare for 5.2.0-rc4 scylla-5.2.0-rc4	2023-04-02 16:40:56 +03:00
Tzach Livyatan	1fba43c317	docs: minor improvments to the Raft Handling Failures and recovery procedure sections Closes #13292 (cherry picked from commit `46e6c639d9`)	2023-03-31 11:22:20 +02:00
Botond Dénes	e380c24c69	Merge 'Improve database shutdown verbosity' from Pavel Emelyanov The `database::stop` method is sometimes hanging and it's always hard to spot where exactly it sleeps. Few more logging messages would make this much simpler. refs: #13100 refs: #10941 Closes #13141 * github.com:scylladb/scylladb: database: Increase verbosity of database::stop() method large_data_handler: Increase verbosity on shutdown large_data_handler: Coroutinize .stop() method (cherry picked from commit `e22b27a107`)	2023-03-30 17:01:24 +03:00
Avi Kivity	76a76a95f4	Update tools/java submodule (hdrhistogram with Java 11) * tools/java 1c4e1e7a7d...83b2168b19 (1): > Fix cassandra-stress -log hdrfile=... with java 11 Fixes #13287	2023-03-29 14:10:27 +03:00
Anna Stuchlik	f6837afec7	doc: update the Ubuntu version used in the image Starting from 5.2 and 2023.1 our images are based on Ubuntu:22.04. See https://github.com/scylladb/scylladb/issues/13138#issuecomment-1467737084 This commit adds that information to the docs. It should be merged and backported to branch-5.2. Closes #13301 (cherry picked from commit `9e27f6b4b7`)	2023-03-27 14:08:57 +03:00
Botond Dénes	6350c8836d	Revert "repair: Reduce repair reader eviction with diff shard count" This reverts commit `c6087cf3a0`. Said commit can cause a deadlock when 2 or more repairs compete for locks on 2 or more nodes. Consider the following scenario: Node n1 and n2 in the cluster, 1 shard per node, rf = 2, each shard has 1 available unit for the reader lock n1 starts repair r1 r1-n1 (instance of r1 on node1) takes the reader lock on node1 n2 starts repair r2 r2-n2 (instance of r2 on node2) takes the reader lock on node2 r1-n2 will fail to take the reader lock on node2 r2-n1 will fail to take the reader lock on node1 As a result, r1 and r2 could not make progress and deadlock happens. The complexity comes from the fact that a repair job needs lock on more than one node. It is not guaranteed that all the participant nodes could take the lock in one short. There is no simple solution to this so we have to revert this locking mechanism and look for another way to prevent reader trashing when repairing nodes with mismatching shard count. Fixes: #12693 Closes #13266 (cherry picked from commit `7699904c54`)	2023-03-24 09:44:16 +02:00
Avi Kivity	5457948437	Update seastar submodule (rpc cancellation during negotiation) * seastar 8889cbc198...1488aaf842 (1): > Merge 'Keep outgoing queue all cancellable while negotiating (again)' from Pavel Emelyanov Fixes #11507.	2023-03-23 17:15:00 +02:00
Avi Kivity	da41001b5c	.gitmodules: point seastar submodule at scylla-seastar.git This allows is to backport seastar commits. Ref #11507.	2023-03-23 17:11:43 +02:00
Anna Stuchlik	dd61e8634c	doc: related https://github.com/scylladb/scylladb/issues/12754 ; add the missing information about reporting latencies to the upgrade guide 5.1 to 5.2 Closes #12935 (cherry picked from commit `26bb36cdf5`)	2023-03-22 10:38:28 +02:00
Anna Stuchlik	b642b4c30e	doc: fix the service name in upgrade guides Fixes https://github.com/scylladb/scylladb/issues/13207 This commit fixes the service and package names in the upgrade guides 5.0-to-2022.1 and 5.1-to-2022.2. Service name: scylla-server Package name: scylla-enterprise Previous PRs to fix the same issue in other upgrade guides: https://github.com/scylladb/scylladb/pull/12679 https://github.com/scylladb/scylladb/pull/12698 This commit must be backported to branch-5.1 and branch 5.2. Closes #13225 (cherry picked from commit `922f6ba3dd`)	2023-03-22 10:37:12 +02:00
Botond Dénes	c013336121	db/view/view_update_check: check_needs_view_update_path(): filter out non-member hosts We currently don't clean up the system_distributed.view_build_status table after removed nodes. This can cause false-positive check for whether view update generation is needed for streaming. The proper fix is to clean up this table, but that will be more involved, it even when done, it might not be immediate. So until then and to be on the safe side, filter out entries belonging to unknown hosts from said table. Fixes: #11905 Refs: #11836 Closes #11860 (cherry picked from commit `84a69b6adb`)	2023-03-22 09:03:50 +02:00
Kamil Braun	b6b35ce061	service: storage_proxy: sequence CDC preimage select with Paxos learn `paxos_response_handler::learn_decision` was calling `cdc_service::augment_mutation_call` concurrently with `storage_proxy::mutate_internal`. `augment_mutation_call` was selecting rows from the base table in order to create the preimage, while `mutate_internal` was writing rows to the table. It was therefore possible for the preimage to observe the update that it accompanied, which doesn't make any sense, because the preimage is supposed to show the state before the update. Fix this by performing the operations sequentially. We can still perform the CDC mutation write concurrently with the base mutation write. `cdc_with_lwt_test` was sometimes failing in debug mode due to this bug and was marked flaky. Unmark it. Fixes #12098 (cherry picked from commit `1ef113691a`)	2023-03-21 20:23:19 +02:00
Petr Gusev	069e38f02d	transport server: fix unexpected server errors handling If request processing ended with an error, it is worth sending the error to the client through make_error/write_response. Previously in this case we just wrote a message to the log and didn't handle the client connection in any way. As a result, the only thing the client got in this case was timeout error. A new test_batch_with_error is added. It is quite difficult to reproduce error condition in a test, so we use error injection instead. Passing injection_key in the body of the request ensures that the exception will be thrown only for this test request and will not affect other requests that the driver may send in the background. Closes: scylladb#12104 (cherry picked from commit `a4cf509c3d`)	2023-03-21 20:23:09 +02:00
Anna Mikhlin	61a8003ad1	release: prepare for 5.2.0-rc3 scylla-5.2.0-rc3	2023-03-20 10:10:27 +02:00

1 2 3 4 5 ...

34732 Commits