scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-29 11:10:40 +00:00

Author	SHA1	Message	Date
Tzach Livyatan	bb3751334c	Remove Ubuntu 18.04 support from 5.2 Ubuntu [18.04 will be soon out of standard support](https://ubuntu.com/blog/18-04-end-of-standard-support), and can be removed from 5.2 supported list https://github.com/scylladb/scylla-pkg/issues/3346 Closes #13529 (cherry picked from commit `e655060429`)	2023-05-30 16:25:42 +03:00
Beni Peled	9dd70a58c3	release: prepare for 5.2.2 scylla-5.2.2	2023-05-18 14:03:20 +03:00
Anna Stuchlik	0bc6694ac5	doc: fix the links to the Enterprise docs Fixes https://github.com/scylladb/scylladb/issues/13915 This commit fixes broken links to the Enterprise docs. They are links to the enterprise branch, which is not published. The links to the Enterprise docs should include "stable" instead of the branch name. This commit must be backported to branch-5.2, because the broken links are present in the published 5.2 docs. Closes #13917 (cherry picked from commit `6f4a68175b`)	2023-05-18 08:40:02 +03:00
Botond Dénes	486483b379	Merge '[Backport 5.2]: node ops backports' from Benny Halevy This branch backports to branch-5.2 several fixes related to node operations: - `ba919aa88a` (PR #12980; Fixes: #11011, #12969) - `53636167ca` (part of PR #12970; Fixes: #12764, #12956) - `5856e69462` (part of PR #12970) - `2b44631ded` (PR #13028; Fixes: #12989) - `6373452b31` (PR #12799; Fixes #12798) Closes #13531 * github.com:scylladb/scylladb: Merge 'Do not mask node operation errors' from Benny Halevy Merge 'storage_service: Make node operations safer by detecting asymmetric abort' from Tomasz Grabiec storage_service: Wait for normal state handler to finish in replace storage_service: Wait for normal state handler to finish in bootstrap storage_service: Send heartbeat earlier for node ops	2023-05-17 16:46:49 +03:00
Tzach Livyatan	9afaec5b12	Update Azure recommended instances type from the Lsv2-series to the Lsv3-series Closes #13835 (cherry picked from commit `a73fde6888`)	2023-05-17 15:41:47 +03:00
Anna Stuchlik	9c99dc36b5	doc: add OS support for version 2023.1 Fixes https://github.com/scylladb/scylladb/issues/13857 This commit adds the OS support for ScyllaDB Enterprise 2023.1. The support is the same as for ScyllaDB Open Source 5.2, on which 2023.1 is based. After this commit is merged, it must be backported to branch-5.2. In this way, it will be merged to branch-2023.1 and available in the docs for Enterprise 2023.1 Closes: #13858 (cherry picked from commit `84ed95f86f`)	2023-05-16 10:11:21 +03:00
Tomasz Grabiec	548a7f73d3	Merge 'range_tombstone_change_generator: fix an edge case in flush()' from Michał Chojnowski range_tombstone_change_generator::flush() mishandles the case when two range tombstones are adjacent and flush(pos, end_of_range=true) is called with pos equal to the end bound of the lesser-position range tombstone. In such case, the start change of the greater-position rtc will be accidentally emitted, and there won't be an end change, which breaks reader assumptions by ending the stream with an unclosed range tombstone, triggering an assertion. This is due to a non-strict inequality used in a place where strict inequality should be used. The modified line was intended to close range tombstones which end exactly on the flush position, but this is unnecessary because such range tombstones are handled by the last `if` in the function anyway. Instead, this line caused range tombstones beginning right after the flush position to be emitted sometimes. Fixes https://github.com/scylladb/scylladb/issues/12462 Closes #13894 * github.com:scylladb/scylladb: tests: row_cache: Add reproducer for reader producing missing closing range tombstone range_tombstone_change_generator: fix an edge case in flush()	2023-05-15 23:29:08 +02:00
Raphael S. Carvalho	5c66875dbe	sstables: Fix use-after-move when making reader in reverse mode static report: sstables/mx/reader.cc:1705:58: error: invalid invocation of method 'operator' on object 'schema' while it is in the 'consumed' state [-Werror,-Wconsumed] legacy_reverse_slice_to_native_reverse_slice(schema, slice.get()), pc, std::move(trace_state), fwd, fwd_mr, monitor); Fixes #13394. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `213eaab246`)	2023-05-15 20:27:34 +03:00
Raphael S. Carvalho	26b4d2c3c1	db/view/build_progress_virtual_reader: Fix use-after-move use-after-free in ctor, which potentially leads to a failure when locating table from moved schema object. static report In file included from db/system_keyspace.cc:51: ./db/view/build_progress_virtual_reader.hh:202:40: warning: invalid invocation of method 'operator->' on object 's' while it is in the 'consumed' state [-Wconsumed] _db.find_column_family(s->ks_name(), system_keyspace::v3::SCYLLA_VIEWS_BUILDS_IN_PROGRESS), Fixes #13395. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `1ecba373d6`)	2023-05-15 20:26:01 +03:00
Raphael S. Carvalho	874062b72a	index/built_indexes_virtual_reader.hh: Fix use-after-move static report: ./index/built_indexes_virtual_reader.hh:228:40: warning: invalid invocation of method 'operator->' on object 's' while it is in the 'consumed' state [-Wconsumed] _db.find_column_family(s->ks_name(), system_keyspace::v3::BUILT_VIEWS), Fixes #13396. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `f8df3c72d4`)	2023-05-15 20:24:35 +03:00
Raphael S. Carvalho	71ec750a59	replica: Fix use-after-move in table::make_streaming_reader Variant used by streaming/stream_transfer_task.cc: , reader(cf.make_streaming_reader(cf.schema(), std::move(permit_), prs)) as full slice is retrieved after schema is moved (clang evaluates left-to-right), the stream transfer task can be potentially working on a stale slice for a particular set of partitions. static report: In file included from replica/dirty_memory_manager.cc:6: replica/database.hh:706:83: error: invalid invocation of method 'operator->' on object 'schema' while it is in the 'consumed' state [-Werror,-Wconsumed] return make_streaming_reader(std::move(schema), std::move(permit), range, schema->full_slice()); Fixes #13397. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `04932a66d3`)	2023-05-15 20:21:48 +03:00
Tomasz Grabiec	7c1bdc6553	tests: row_cache: Add reproducer for reader producing missing closing range tombstone Adds a reproducer for #12462. The bug manifests by reader throwing: std::logic_error: Stream ends with an active range tombstone: {range_tombstone_change: pos={position: clustered,ckp{},-1}, {tombstone: timestamp=-9223372036854775805, deletion_time=2}} The reason is that prior to the fix range_tombstone_change_generator::flush() was used with end_of_range=true to produce the closing range_tombstone_change and it did not handle correctly the case when there are two adjacent range tombstones and flush(pos, end_of_range=true) is called such that pos is the boundary between the two. Cherry-picked from `a717c803c7`.	2023-05-15 18:02:40 +02:00
Michał Chojnowski	24d966f806	range_tombstone_change_generator: fix an edge case in flush() range_tombstone_change_generator::flush() mishandles the case when two range tombstones are adjacent and flush(pos, end_of_range=true) is called with pos equal to the end bound of the lesser-position range tombstone. In such case, the start change of the greater-position rtc will be accidentally emitted, and there won't be an end change, which breaks reader assumptions by ending the stream with an unclosed range tombstone, triggering an assertion. This is due to a non-strict inequality used in a place where strict inequality should be used. The modified line was intended to close range tombstones which end exactly on the flush position, but this is unnecessary because such range tombstones are handled by the last `if` in the function anyway. Instead, this line caused range tombstones beginning right after the flush position to be emitted sometimes. Fixes #12462	2023-05-15 17:48:24 +02:00
Asias He	05a3a1bf55	tombstone_gc: Fix gc_before for immediate mode The immediate mode is similar to timeout mode with gc_grace_seconds zero. Thus, the gc_before returned should be the query_time instead of gc_clock::time_point::max in immediate mode. Setting gc_before to gc_clock::time_point::max, a row could be dropped by compaction even if the ttl is not expired yet. The following procedure reproduces the issue: - Start 2 nodes - Insert data ``` CREATE KEYSPACE ks2a WITH REPLICATION = { 'class' : 'SimpleStrategy', 'replication_factor' : 2 }; CREATE TABLE ks2a.tb (pk int, ck int, c0 text, c1 text, c2 text, PRIMARY KEY(pk, ck)) WITH tombstone_gc = {'mode': 'immediate'}; INSERT into ks2a.tb (pk,ck, c0, c1, c2) values (10 ,1, 'x', 'y', 'z') USING TTL 1000000; INSERT into ks2a.tb (pk,ck, c0, c1, c2) values (20 ,1, 'x', 'y', 'z') USING TTL 1000000; INSERT into ks2a.tb (pk,ck, c0, c1, c2) values (30 ,1, 'x', 'y', 'z') USING TTL 1000000; ``` - Run nodetool flush and nodetool compact - Compaction drops all data ``` ~128 total partitions merged to 0. ``` Fixes #13572 Closes #13800 (cherry picked from commit `7fcc403122`)	2023-05-15 10:33:29 +03:00
Takuya ASADA	f148a6be1d	scylla_kernel_check: suppress verbose iotune messages Stop printing verbose iotune messages while the check, just print error message. Fixes #13373. Closes #13362 (cherry picked from commit `160c184d0b`)	2023-05-14 21:25:57 +03:00
Benny Halevy	5785550e24	view: view_builder: start: demote sleep_aborted log error This is not really an error, so print it in debug log_level rather than error log_level. Fixes #13374 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #13462 (cherry picked from commit `cc42f00232`)	2023-05-14 21:21:59 +03:00
Avi Kivity	401de17c82	Update seastar submodule (condition_variable tasktrace fix) * seastar aa46b980ec...98504c4bb6 (1): > condition-variable: replace the coroutine wakeup task with a promise Fixes #13368	2023-05-14 21:12:12 +03:00
Raphael S. Carvalho	94c9553e8a	Fix use-after-move when initializing row cache with dummy entry Courtersy of clang-tidy: row_cache.cc:1191:28: warning: 'entry' used after it was moved [bugprone-use-after-move] _partitions.insert(entry.position().token().raw(), std::move(entry), dht::ring_position_comparator{_schema}); ^ row_cache.cc:1191:60: note: move occurred here _partitions.insert(entry.position().token().raw(), std::move(entry), dht::ring_position_comparator{_schema}); ^ row_cache.cc:1191:28: note: the use and move are unsequenced, i.e. there is no guarantee about the order in which they are evaluated _partitions.insert(entry.position().token().raw(), std::move(entry), dht::ring_position_comparator{*_schema}); The use-after-move is UB, as for it to happen, depends on evaluation order. We haven't hit it yet as clang is left-to-right. Fixes #13400. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #13401 (cherry picked from commit `d2d151ae5b`)	2023-05-14 21:02:24 +03:00
Anna Mikhlin	f1c45553bc	release: prepare for 5.2.1 scylla-5.2.1	2023-05-08 22:15:46 +03:00
Botond Dénes	1a288e0a78	Update seastar submodule * seastar 1488aaf8...aa46b980 (1): > core/on_internal_error: always log error with backtrace Fixes: #13786	2023-05-08 10:30:10 +03:00
Marcin Maliszkiewicz	a2fed1588e	db: view: use deferred_close for closing staging_sstable_reader When consume_in_thread throws the reader should still be closed. Related https://github.com/scylladb/scylla-enterprise/issues/2661 Closes #13398 Refs: scylladb/scylla-enterprise#2661 Fixes: #13413 (cherry picked from commit `99f8d7dcbe`)	2023-05-08 09:41:07 +03:00
Botond Dénes	f07a06d390	Merge 'service:forward_service: use long type instead of counter in function mocking' from Michał Jadwiszczak Aggregation query on counter column is failing because forward_service is looking for function with counter as an argument and such function doesn't exist. Instead the long type should be used. Fixes: #12939 Closes #12963 * github.com:scylladb/scylladb: test:boost: counter column parallelized aggregation test service:forward_service: use long type when column is counter (cherry picked from commit `61e67b865a`)	2023-05-07 14:27:29 +03:00
Anna Stuchlik	4ec531d807	doc: remove the sequential repair option from docs Fixes https://github.com/scylladb/scylladb/issues/12132 The sequential repair mode is not supported. This commit removes the incorrect information from the documentation. Closes #13544 (cherry picked from commit `3d25edf539`)	2023-05-07 14:27:29 +03:00
Asias He	4867683f80	storage_service: Fix removing replace node as pending Consider - n1, n2, n3 - n3 is down - n4 replaces n3 with the same ip address 127.0.0.3 - Inside the storage_service::handle_state_normal callback for 127.0.0.3 on n1/n2 ``` auto host_id = _gossiper.get_host_id(endpoint); auto existing = tmptr->get_endpoint_for_host_id(host_id); ``` host_id = new host id existing = empty As a result, del_replacing_endpoint() will not be called. This means 127.0.0.3 will not be removed as a pending node on n1 and n2 when replacing is done. This is wrong. This is a regression since commit `9942c60d93` (storage_service: do not inherit the host_id of a replaced a node), where replacing node uses a new host id than the node to be replaced. To fix, call del_replacing_endpoint() when a node becomes NORMAL and existing is empty. Before: n1: storage_service - replace[cd1f187a-0eee-4b04-91a9-905ecc499cfc]: Added replacing_node=127.0.0.3 to replace existing_node=127.0.0.3, coordinator=127.0.0.3 token_metadata - Added node 127.0.0.3 as pending replacing endpoint which replaces existing node 127.0.0.3 storage_service - replace[cd1f187a-0eee-4b04-91a9-905ecc499cfc]: Marked ops done from coordinator=127.0.0.3 storage_service - Node 127.0.0.3 state jump to normal storage_service - Set host_id=6f9ba4e8-9457-4c76-8e2a-e2be257fe123 to be owned by node=127.0.0.3 After: n1: storage_service - replace[28191ea6-d43b-3168-ab01-c7e7736021aa]: Added replacing_node=127.0.0.3 to replace existing_node=127.0.0.3, coordinator=127.0.0.3 token_metadata - Added node 127.0.0.3 as pending replacing endpoint which replaces existing node 127.0.0.3 storage_service - replace[28191ea6-d43b-3168-ab01-c7e7736021aa]: Marked ops done from coordinator=127.0.0.3 storage_service - Node 127.0.0.3 state jump to normal token_metadata - Removed node 127.0.0.3 as pending replacing endpoint which replaces existing node 127.0.0.3 storage_service - Set host_id=72219180-e3d1-4752-b644-5c896e4c2fed to be owned by node=127.0.0.3 Tests: https://github.com/scylladb/scylla-dtest/pull/3126 Closes #13677 Fixes: https://github.com/scylladb/scylla-enterprise/issues/2852 (cherry picked from commit `a8040306bb`)	2023-05-03 14:15:13 +03:00
Botond Dénes	0e42defe06	readers: evictable_reader: skip progress guarantee when next pos is partition start The evictable reader must ensure that each buffer fill makes forward progress, i.e. the last fragment in the buffer has a position larger than the last fragment from the last buffer-fill. Otherwise, the reader could get stuck in an infinite loop between buffer fills, if the reader is evicted in-between. The code guranteeing this forward change has a bug: when the next expected position is a partition-start (another partition), the code would loop forever, effectively reading all there is from the underlying reader. To avoid this, add a special case to ignore the progress guarantee loop altogether when the next expected position is a partition start. In this case, progress is garanteed anyway, because there is exactly one partition-start fragment in each partition. Fixes: #13491 Closes #13563 (cherry picked from commit `72003dc35c`)	2023-05-02 21:58:41 +03:00
Avi Kivity	f73d017f05	tools: toolchain: regenerate Fixes #13744	2023-05-02 13:16:59 +03:00
Pavel Emelyanov	3723678b82	scylla-gdb: Parse and eval _all_threads without quotes I've no idea why the quotes are there at all, it works even without them. However, with quotes gdb-13 fails to find the _all_threads static thread-local variable _unless_ it's printed with gdb "p" command beforehand. fixes: #13125 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13132 (cherry picked from commit `537510f7d2`)	2023-05-02 13:16:59 +03:00
Botond Dénes	ea506f50cc	Merge 'Do not mask node operation errors' from Benny Halevy This series handles errors when aborting node operations and prints them rather letting them leak and be exposed to the user. Also, cleanup the node_ops logging formats when aborting different node ops and add more error logging around errors in the "worker" nodes. Closes #12799 * github.com:scylladb/scylladb: storage_service: node_ops_signal_abort: print a warning when signaling abort storage_service: s/node_ops_singal_abort/node_ops_signal_abort/ storage_service: node_ops_abort: add log messages storage_service: wire node_ops_ctl for node operations storage_service: add node_ops_ctl class to formalize all node_ops flow repair: node_ops_cmd_request: add print function repair: do_decommission_removenode_with_repair: log ignore_nodes repair: replace_with_repair: get ignore_nodes as unordered_set gossiper: get_generation_for_nodes: get nodes as unordered_set storage_service: don't let node_ops abort failures mask the real error (cherry picked from commit `6373452b31`)	2023-04-30 18:58:28 +03:00
Kamil Braun	42fd3704e4	Merge 'storage_service: Make node operations safer by detecting asymmetric abort' from Tomasz Grabiec This patch fixes a problem which affects decommission and removenode which may lead to data consistency problems under conditions which lead one of the nodes to unliaterally decide to abort the node operation without the coordinator noticing. If this happens during streaming, the node operation coordinator would proceed to make a change in the gossiper, and only later dectect that one of the nodes aborted during sending of decommission_done or removenode_done command. That's too late, because the operation will be finalized by all the nodes once gossip propagates. It's unsafe to finalize the operation while another node aborted. The other node reverted to the old topolgy, with which they were running for some time, without considering the pending replica when handling requests. As a result, we may end up with consistency issues. Writes made by those coordinators may not be replicated to CL replicas in the new topology. Streaming may have missed to replicate those writes depending on timing. It's possible that some node aborts but streaming succeeds if the abort is not due to network problems, or if the network problems are transient and/or localized and affect only heartbeats. There is no way to revert after we commit the node operation to the gossiper, so it's ok to close node_ops sessions before making the change to the gossiper, and thus detect aborts and prevent later aborts after the change in the gossiper is made. This is already done during bootstrap (RBNO enabled) and replacenode. This patch canges removenode to also take this approach by moving sending of remove_done earlier. We cannot take this approach with decommission easily, because decommission_done command includes a wait for the node to leave the ring, which won't happen before the change to the gossiper is made. Separating this from decommission_done would require protocol changes. This patch adds a second-best solution, which is to check if sessions are still there right before making a change to the gossiper, leaving decommission_done where it was. The race can still happen, but the time window is now much smaller. The PR also lays down infrastructure which enables testing the scenarios. It makes node ops watchdog periods configurable, and adds error injections. Fixes #12989 Refs #12969 Closes #13028 * github.com:scylladb/scylladb: storage_service: node ops: Extract node_ops_insert() to reduce code duplication storage_service: Make node operations safer by detecting asymmetric abort storage_service: node ops: Add error injections service: node_ops: Make watchdog and heartbeat intervals configurable (cherry picked from commit `2b44631ded`)	2023-04-30 18:58:28 +03:00
Asias He	c9d19b3595	storage_service: Wait for normal state handler to finish in replace Similar to "storage_service: Wait for normal state handler to finish in bootstrap", this patch enables the check on the replace procedure. (cherry picked from commit `5856e69462`)	2023-04-30 18:58:28 +03:00
Asias He	9a873bf4b3	storage_service: Wait for normal state handler to finish in bootstrap In storage_service::handle_state_normal, storage_service::notify_joined will be called which drops the rpc connections to the node becomes normal. This causes rpc calls with that node fail with seastar::rpc::closed_error error. Consider this: - n1 in the cluster - n2 is added to join the cluster - n2 sees n1 is in normal status - n2 starts bootstrap process - notify_joined on n2 closes rpc connection to n1 in the middle of bootstrap - n2 fails to bootstrap For example, during bootstrap with RBNO, we saw repair failed in a test that sets ring_delay to zero and does not wait for gossip to settle. repair - repair[9cd0dbf8-4bca-48fc-9b1c-d9e80d0313a2]: sync data for keyspace=system_distributed_everywhere, status=failed: std::runtime_error ({shard 0: seastar::rpc::closed_error (connection is closed)}) This patch fixes the race by waiting for the handle_state_normal handler to finish before the bootstrap process. Fixes #12764 Fixes #12956 (cherry picked from commit `53636167ca`)	2023-04-30 18:58:28 +03:00
Asias He	51a00280a2	storage_service: Send heartbeat earlier for node ops Node ops has the following procedure: 1 for node in sync_nodes send prepare cmd to node 2 for node in sync_nodes send heartbeat cmd to node If any of the prepare cmd in step 1 takes longer than the heartbeat watchdog timeout, the heartbeat in step 2 will be too late to update the watchdog, as a result the watchdog will abort the operation. To prevent slow prepare cmd kills the node operations, we can start the heartbeat earlier in the procedure. Fixes #11011 Fixes #12969 Closes #12980 (cherry picked from commit `ba919aa88a`)	2023-04-30 18:58:28 +03:00
Wojciech Mitros	b0a7c02e09	rust: update dependencies Cranelift-codegen 0.92.0 and wasmtime 5.0.0 have security issues potentially allowing malicious UDFs to read some memory outside the wasm sandbox. This patch updates them to versions 0.92.1 and 5.0.1 respectively, where the issues are fixed. Fixes #13157 Closes #13171 (cherry picked from commit `aad2afd417`)	2023-04-27 22:01:44 +03:00
Wojciech Mitros	f18c49dcc6	rust: update dependencies Wasmtime added some improvements in recent releases - particularly, two security issues were patched in version 2.0.2. There were no breaking changes for our use other than the strategy of returning Traps - all of them are now anyhow::Errors instead, but we can still downcast to them, and read the corresponding error message. The cxx, anyhow and futures dependency versions now match the versions saved in the Cargo.lock. Closes #12830 (cherry picked from commit `8b756cb73f`) Ref #13157	2023-04-27 22:00:54 +03:00
Anna Stuchlik	35dfec78d1	doc: fixes https://github.com/scylladb/scylladb/issues/12964 , removes the information that the CDC options are experimental Closes #12973 (cherry picked from commit `4dd1659d0b`)	2023-04-27 21:06:49 +03:00
Raphael S. Carvalho	dbd8ca4ade	replica: Fix undefined behavior in table::generate_and_propagate_view_updates() Undefined behavior because the evaluation order is undefined. With GCC, where evaluation is right-to-left, schema will be moved once it's forwarded to make_flat_mutation_reader_from_mutations_v2(). The consequence is that memory tracking of mutation_fragment_v2 (for tracking only permit used by view update), which uses the schema, can be incorrect. However, it's more likely that Scylla will crash when estimating memory usage for row, which access schema column information using schema::column_at(), which in turn asserts that the requested column does really exist. Fixes #13093. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #13092 (cherry picked from commit `3fae46203d`)	2023-04-27 19:56:38 +03:00
Anna Stuchlik	1be4afb842	doc: remove incorrect info about BYPASS CACHE Fixes https://github.com/scylladb/scylladb/issues/13106 This commit removes the information that BYPASS CACHE is an Enterprise-only feature and replaces that info with the link to the BYPASS CACHE description. Closes #13316 (cherry picked from commit `1cfea1f13c`)	2023-04-27 19:54:04 +03:00
Kefu Chai	7cc9f5a05f	dist/redhat: enforce dependency on %{release} also * tools/python3 279b6c1...cf7030a (1): > dist: redhat: provide only a single version s/%{version}/%{version}-%{release}/ in `Requires:` sections. this enforces the runtime dependencies of exactly the same releases between scylla packages. Fixes #13222 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> (cherry picked from commit `7165551fd7`)	2023-04-27 19:27:34 +03:00
Nadav Har'El	bf7fc9709d	test/rest_api: fix flaky test for toppartitions The REST test test_storage_service.py::test_toppartitions_pk_needs_escaping was flaky. It tests the toppartition request, which unfortunately needs to choose a sampling duration in advance, and we chose 1 second which we considered more than enough - and indeed typically even 1ms is enough! but very rarely (only know of only one occurance, in issue #13223) one second is not enough. Instead of increasing this 1 second and making this test even slower, this patch takes a retry approach: The tests starts with a 0.01 second duration, and is then retried with increasing durations until it succeeds or a 5-seconds duration is reached. This retry approach has two benefits: 1. It de-flakes the test (allowing a very slow test to take 5 seconds instead of 1 seconds which wasn't enough), and 2. At the same time it makes a successful test much faster (it used to always take a full second, now it takes 0.07 seconds on a dev build on my laptop). A failed test may, in some cases, take 10 seconds after this patch (although in some other cases, an error will be caught immediately), but I consider this acceptable - this test should pass, after all, and a failure indicates a regression and taking 10 seconds will be the last of our worries in that case. Fixes #13223. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13238 (cherry picked from commit `c550e681d7`)	2023-04-27 19:16:58 +03:00
Nadav Har'El	00a8c3a433	test/alternator: increase CQL connection timeout This patch increases the connection timeout in the get_cql_cluster() function in test/cql-pytest/run.py. This function is used to test that Scylla came up, and also test/alternator/run uses it to set up the authentication - which can only be done through CQL. The Python driver has 2-second and 5-second default timeouts that should have been more than enough for everybody (TM), but in #13239 we saw that in one case it apparently wasn't enough. So to be extra safe, let's increase the default connection-related timeouts to 60 seconds. Note this change only affects the Scylla boot in the test/*/run scripts, and it does not affect the actual tests - those have different code to connect to Scylla (see cql_session() in test/cql-pytest/util.py), and we already increased the timeouts there in #11289. Fixes #13239 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13291 (cherry picked from commit `4fdcee8415`)	2023-04-27 19:15:39 +03:00
Tomasz Grabiec	c08ed39a33	direct_failure_detector: Avoid throwing exceptions in the success path sleep_abortable() is aborted on success, which causes sleep_aborted exception to be thrown. This causes scylla to throw every 100ms for each pinged node. Throwing may reduce performance if happens often. Also, it spams the logs if --logger-log-level exception=trace is enabled. Avoid by swallowing the exception on cancellation. Fixes #13278. Closes #13279 (cherry picked from commit `99cb948eac`)	2023-04-27 19:14:31 +03:00
Kefu Chai	04424f8956	test: cql-pytest: test_describe: clamp bloom filter's fp rate before this change, we use `round(random.random(), 5)` for the value of `bloom_filter_fp_chance` config option. there are chances that this expression could return a number lower or equal to 6.71e-05. but we do have a minimal for this option, which is defined by `utils::bloom_calculations::probs`. and the minimal false positive rate is 6.71e-05. we are observing test failures where the we are using 0 for the option, and scylla right rejected it with the error message of ``` bloom_filter_fp_chance must be larger than 6.71e-05 and less than or equal to 1.0 (got 0) ```. so, in this change, to address the test failure, we always use a number slightly greater or equal to a number slightly greater to the minimum to ensure that the randomly picked number is in the range of supported false positive rate. Fixes #13313 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13314 (cherry picked from commit `33f4012eeb`)	2023-04-27 19:12:53 +03:00
Beni Peled	429b696bbc	release: prepare for 5.2.0 scylla-5.2.0	2023-04-27 16:26:43 +03:00
Beni Peled	a89867d8c2	release: prepare for 5.2.0-rc5 scylla-5.2.0-rc5	2023-04-25 14:37:54 +03:00
Benny Halevy	6ad94fedf3	utils: clear_gently: do not clear null unique_ptr Otherwise the null pointer is dereferenced. Add a unit test reproducing the issue and testing this fix. Fixes #13636 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `12877ad026`)	2023-04-24 17:51:01 +03:00
Anna Stuchlik	a6188d6abc	doc: document `tombstone_gc` as not experimental The tombstone_gc was documented as experimental in version 5.0. It is no longer experimental in version 5.2. This commit updates the information about the option. Closes #13469 (cherry picked from commit `a68b976c91`)	2023-04-24 11:54:06 +03:00
Botond Dénes	50095cc3a5	Merge 'db: system_keyspace: use microsecond resolution for group0_history range tombstone' from Kamil Braun in `make_group0_history_state_id_mutation`, when adding a new entry to the group 0 history table, if the parameter `gc_older_than` is engaged, we create a range tombstone in the mutation which deletes entries older than the new one by `gc_older_than`. In particular if `gc_older_than = 0`, we want to delete all older entries. There was a subtle bug there: we were using millisecond resolution when generating the tombstone, while the provided state IDs used microsecond resolution. On a super fast machine it could happen that we managed to perform two schema changes in a single millisecond; this happened sometimes in `group0_test.test_group0_history_clearing_old_entries` on our new CI/promotion machines, causing the test to fail because the tombstone didn't clear the entry correspodning to the previous schema change when performing the next schema change (since they happened in the same millisecond). Use microsecond resolution to fix that. The consecutive state IDs used in group 0 mutations are guaranteed to be strictly monotonic at microsecond resolution (see `generate_group0_state_id` in service/raft/raft_group0_client.cc). Fixes #13594 Closes #13604 * github.com:scylladb/scylladb: db: system_keyspace: use microsecond resolution for group0_history range tombstone utils: UUID_gen: accept decimicroseconds in min_time_UUID (cherry picked from commit `10c1f1dc80`)	2023-04-23 16:03:02 +03:00
Botond Dénes	7b2215d8e0	Merge 'Backport bugfixes regarding UDT, UDF, UDA interactions to branch-5.2' from Wojciech Mitros This patch backports https://github.com/scylladb/scylladb/pull/12710 to branch-5.2. To resolve the conflicts that it's causing, it also includes * https://github.com/scylladb/scylladb/pull/12680 * https://github.com/scylladb/scylladb/pull/12681 Closes #13542 * github.com:scylladb/scylladb: uda: change the UDF used in a UDA if it's replaced functions: add helper same_signature method uda: return aggregate functions as shared pointers udf: also check reducefunc to confirm that a UDF is not used in a UDA udf: fix dropping UDFs that share names with other UDFs used in UDAs pytest: add optional argument for new_function argument types udt: disallow dropping a user type used in a user function	2023-04-19 01:38:08 -04:00
Botond Dénes	da9f90362d	Merge 'Compaction reevaluation bug fixes' from Raphael "Raph" Carvalho A problem in compaction reevaluation can cause the SSTable set to be left uncompacted for indefinite amount of time, potentially causing space and read amplification to be suboptimal. Two revaluation problems are being fixed, one after off-strategy compaction ended, and another in compaction manager which intends to periodically reevaluate a need for compaction. Fixes https://github.com/scylladb/scylladb/issues/13429. Fixes https://github.com/scylladb/scylladb/issues/13430. Closes #13431 * github.com:scylladb/scylladb: compaction: Make compaction reevaluation actually periodic replica: Reevaluate regular compaction on off-strategy completion (cherry picked from commit `9a02315c6b`)	2023-04-19 01:14:33 -04:00
Botond Dénes	c9a17c80f6	mutation/mutation_compactor: consume_partition_end(): reset _stop The purpose of `_stop` is to remember whether the consumption of the last partition was interrupted or it was consumed fully. In the former case, the compactor allows retreiving the compaction state for the given partition, so that its compaction can be resumed at a later point in time. Currently, `_stop` is set to `stop_iteration::yes` whenever the return value of any of the `consume()` methods is also `stop_iteration::yes`. Meaning, if the consuming of the partition is interrupted, this is remembered in `_stop`. However, a partition whose consumption was interrupted is not always continued later. Sometimes consumption of a partitions is interrputed because the partition is not interesting and the downstream consumer wants to stop it. In these cases the compactor should not return an engagned optional from `detach_state()`, because there is not state to detach, the state should be thrown away. This was incorrectly handled so far and is fixed in this patch, but overwriting `_stop` in `consume_partition_end()` with whatever the downstream consumer returns. Meaning if they want to skip the partition, then `_stop` is reset to `stop_partition::no` and `detach_state()` will return a disengaged optional as it should in this case. Fixes: #12629 Closes #13365 (cherry picked from commit `bae62f899d`)	2023-04-18 02:32:24 -04:00

1 2 3 4 5 ...

34770 Commits