scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 05:53:13 +00:00

Author	SHA1	Message	Date
Michał Chojnowski	78d6471ce4	mutation_partition_v2: in apply_monotonically(), avoid bad_alloc on sentinel insertion apply_monotonically() is run with reclaim disabled. So with some bad luck, sentinel insertion might fail with bad_alloc even on a perfectly healthy node. We can't deal with the failure of sentinel insertion, so this will result in a crash. This patch prevents the spurious OOM by reserving some memory (1 LSA segment) and only making it available right before the critical allocations. Fixes scylladb/scylladb#19552	2024-07-08 16:08:27 +02:00
Michał Chojnowski	7b3f55a65f	logalloc: add hold_reserve mutation_partition_v2::apply_monotonically() needs to perform some allocations in a destructor, to ensure that the invariants of the data structure are restored before returning. But it is usually called with reclaiming disabled, so the allocations might fail even in a perfectly healthy node with plenty of reclaimable memory. This patch adds a mechanism which allows to reserve some LSA memory (by asking the allocator to keep it unused) and make it available for allocation right when we need to guarantee allocation success.	2024-07-08 16:08:27 +02:00
Michał Chojnowski	f784be6a7e	logalloc: generalize refill_emergency_reserve() In the next patch, we will want to do the thing as refill_emergency_reserve() does, just with a quantity different than _emergency_reserve_max. So we split off the shareable part to a new function, and use it to implement refill_emergency_reserve().	2024-07-04 12:19:01 +02:00
Kefu Chai	405f624776	cql3: define dtor of modification_statement in .cc file before this change, we rely on the compiler to use the definition of `cql3::attributes` to generate the defaulted destructor in .cc file. but with clang-19, it insists that we should have a complete definition available for defining the defaulted destructor, otherwise it fails the build: ``` /home/kefu/.local/bin/clang++ -DFMT_SHARED -DSCYLLA_BUILD_MODE=release -DSEASTAR_API_LEVEL=7 -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SSTRING -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"RelWithDebInfo\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -isystem /home/kefu/dev/scylladb/abseil -ffunction-sections -fdata-sections -O3 -g -gz -std=gnu++23 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-enum-constexpr-conversion -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb=. -march=westmere -mllvm -inline-threshold=2500 -fno-slp-vectorize -U_FORTIFY_SOURCE -Werror=unused-result -MD -MT CMakeFiles/scylla-main.dir/RelWithDebInfo/table_helper.cc.o -MF CMakeFiles/scylla-main.dir/RelWithDebInfo/table_helper.cc.o.d -o CMakeFiles/scylla-main.dir/RelWithDebInfo/table_helper.cc.o -c /home/kefu/dev/scylladb/table_helper.cc In file included from /home/kefu/dev/scylladb/table_helper.cc:10: In file included from /home/kefu/dev/scylladb/seastar/include/seastar/core/coroutine.hh:25: In file included from /home/kefu/dev/scylladb/seastar/include/seastar/core/future.hh:30: In file included from /usr/lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/memory:78: /usr/lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/bits/unique_ptr.h:91:16: error: invalid application of 'sizeof' to an incomplete type 'cql3::attributes' 91 \| static_assert(sizeof(_Tp)>0, \| ^~~~~~~~~~~ /usr/lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/bits/unique_ptr.h:398:4: note: in instantiation of member function 'std::default_delete<cql3::attributes>::operator()' requested here 398 \| get_deleter()(std::move(__ptr)); \| ^ /home/kefu/dev/scylladb/cql3/statements/modification_statement.hh:40:7: note: in instantiation of member function 'std::unique_ptr<cql3::attributes>::~unique_ptr' requested here 40 \| class modification_statement : public cql_statement_opt_metadata { \| ^ /home/kefu/dev/scylladb/cql3/statements/modification_statement.hh:40:7: note: in implicit destructor for 'cql3::statements::modification_statement' first required here /home/kefu/dev/scylladb/cql3/statements/modification_statement.hh:28:7: note: forward declaration of 'cql3::attributes' 28 \| class attributes; \| ^ ``` so, in this change, we define the destructor in .cc file, where the complete definition of `cql3::attributes` is available. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19545	2024-06-30 19:35:05 +03:00
Avi Kivity	0ce00ebfbd	Merge 'Close output stream in task manager's API get_tasks handler' from Pavel Emelyanov If client stops reading response early, the server-side stream throws but must be closed anyway. Seen in another endpoint and fixed by #19541 Closes scylladb/scylladb#19542 * github.com:scylladb/scylladb: api: Fix indentation after previous patch api: Close response stream on error api: Flush response output stream before closing	2024-06-30 19:34:00 +03:00
Avi Kivity	3a85d88b68	Merge 'Close output_stream in get_snapshot_details() API handler' from Pavel Emelyanov All streams used by httpd handlers are to be closed by the handler itself, caller doesn't take care of that. fixes: #19494 Closes scylladb/scylladb#19541 * github.com:scylladb/scylladb: api: Fix indentation after previous patch api: Close output_stream on error api: Flush response output stream before closing	2024-06-30 19:33:16 +03:00
Avi Kivity	2fbc532e4d	Update tools/python3 submodule * tools/python3 3e833f1...18fa79e (1): > reloc: use `--add-rpath` and not `--set-rpath`	2024-06-30 19:31:23 +03:00
Kefu Chai	77d2d5821d	build: cmake: do not mark cqlsh noarch in `3c7af287`, cqlsh's reloc package was marked as "noarch", and its filename was updated accordingly in `configure.py`, so let's update the CMake building system accordingly. this change should address the build failure of ``` 08:48:14 [3325/4124] Generating ../Debug/dist/tar/scylla-cqlsh-6.1.0~dev-0.20240629.60955ead75ef.noarch.tar.gz 08:48:14 FAILED: Debug/dist/tar/scylla-cqlsh-6.1.0~dev-0.20240629.60955ead75ef.noarch.tar.gz /jenkins/workspace/scylla-master/scylla-ci/scylla/build/Debug/dist/tar/scylla-cqlsh-6.1.0~dev-0.20240629.60955ead75ef.noarch.tar.gz 08:48:14 cd /jenkins/workspace/scylla-master/scylla-ci/scylla/build/dist && /usr/bin/cmake -E copy /jenkins/workspace/scylla-master/scylla-ci/scylla/tools/cqlsh/build/scylla-cqlsh-6.1.0~dev-0.20240629.60955ead75ef.noarch.tar.gz /jenkins/workspace/scylla-master/scylla-ci/scylla/build/Debug/dist/tar/scylla-cqlsh-6.1.0~dev-0.20240629.60955ead75ef.noarch.tar.gz 08:48:14 Error copying file "/jenkins/workspace/scylla-master/scylla-ci/scylla/tools/cqlsh/build/scylla-cqlsh-6.1.0~dev-0.20240629.60955ead75ef.noarch.tar.gz" to "/jenkins/workspace/scylla-master/scylla-ci/scylla/build/Debug/dist/tar/scylla-cqlsh-6.1.0~dev-0.20240629.60955ead75ef.noarch.tar.gz". ``` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19544	2024-06-30 19:26:54 +03:00
Kefu Chai	947e28146d	dbuild: pass --tty when running in interactive mode podman does not allocate a tty by default, so without `-t` or `--tty`, one cannot use a functional terminal when interacting with the container. that what one can expect when running `dbuild -i --`, and we are greeted with : ``` bash: cannot set terminal process group (-1): Inappropriate ioctl for device bash: no job control in this shell ``` after this change, one can enjoy the good-old terminal as usual after being dropped to the container provided by `dbuild -i --`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19550	2024-06-30 12:06:55 +03:00
Pavel Emelyanov	d034cde01f	Merge 'build: update C++ standard to C++23' from Avi Kivity Switch the C++ standard from C++20 to C++23. This is straightforward, but there are a few fallouts (mostly due to std::unique_ptr that became constexpr) that need to be fixed first. Internal enhancement - no backport required Closes scylladb/scylladb#19528 * github.com:scylladb/scylladb: build: switch to C++23 config: avoid binding an lvalue reference to an rvalue reference readers: define query::partition_slice before using it in default argument test: define table_for_tests earlier compaction: define compaction_group::table_state earlier compaction: compaction_group: define destructor out-of-line compaction_manager: define compaction_manager::strategy_control earlier	2024-06-28 18:02:33 +03:00
Avi Kivity	cf66f233aa	build: remove aarch64 workarounds In `90a6c3bd7a` ("build: reduce release mode inline tuning on aarch64") we reduced inlining on aarch64, due to miscompiles. In `224a2877b9` ("build: disable -Og in debug mode to avoid coroutine asan breakage") we disabled optimization in debug mode, due to miscompiles. With clang 18.1, it appears the miscompiles are gone, and we can remove the two workarounds. Closes scylladb/scylladb#19531	2024-06-28 17:53:51 +03:00
Pavel Emelyanov	1be8b2fd25	api: Fix indentation after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-28 16:07:21 +03:00
Pavel Emelyanov	986a04cb11	api: Close response stream on error The handler's lambda is called with && stream object and must close the stream on its own regardless of what. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-28 16:06:41 +03:00
Pavel Emelyanov	4897d8f145	api: Flush response output stream before closing The .close() method flushes the stream, but it may throw doing it. Next patch will want .close() not to throw, for that stream must be flushed explicitly before closing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-28 16:05:20 +03:00
Pavel Emelyanov	1839030e3b	api: Fix indentation after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-28 15:41:12 +03:00
Pavel Emelyanov	a0c1552cea	api: Close output_stream on error If the get_snapshot_details() lambda throws, the output stream remains non-closed which is bad. Close it regardless of what. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-28 15:40:42 +03:00
Pavel Emelyanov	d1fd886608	api: Flush response output stream before closing Otherwise close() may throw and this is what next patch will want not to happen. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-28 15:40:00 +03:00
Piotr Dulikowski	f00c4eaf72	Merge '[test.py] add --extra-scylla-cmdline-options argument for test.py' from Artsiom Mishuta this PR has 2 commits - [test: pass Scylla extra CMD args from test.py args](`6b367a04b5`) - [test: adjust scylla_cluster.merge_cmdline_options behavior](`c60b36090a`) the main goal is to solve [test.py: provide an easy-to-remember, univeral way to run scylla with trace level logging](https://github.com/scylladb/scylladb/issues/14960) issue but also can be used to easily apply additional arguments for all UnitTests and PythonTests on the fly from the test.py CMD Closes scylladb/scylladb#19509 * github.com:scylladb/scylladb: test: adjust scylla_cluster.merge_cmdline_options behavior test: pass scylla extra CMD args from test.py args	2024-06-28 11:11:29 +02:00
Kamil Braun	6ec8143e56	Merge 'Remove dead code from migration_manager and schema_tables' from Benny Halevy This short series removed some ancient legacy code from migration_manager and schema_tables, before I make further changes in this area. We have more such code under the cql3 hierarchy but it can be dealt with as a follow up. No backport required Closes scylladb/scylladb#19530 * github.com:scylladb/scylladb: schema_tables: remove dead code migration_manager: remove dead code	2024-06-28 10:59:21 +02:00
Piotr Smaron	88eda47f13	cql: forbid switching from tablets to vnodes in ALTER KS This check is already in place, but isn't fully working, i.e. switching from a vnode KS to a tablets KS is not allowed, but this check doesn't work in the other direction. To fix the latter, `ks_prop_defs::get_initial_tablets()` has been changed to handle 3 states: (1) init_tablets is set, (2) it was skipped, (3) tablets are disabled. These couldn't fit into std::optional, so a new local struct to hold these states has been introduced. Callers of this function have been adjusted to set init_tablets to an appropriate value according to the circumstances, i.e. if tablets are globally enabled, but have been skipped in the CQL, init_tablets is automatically set to 0, but if someone executes ALTER KS and doesn't provide tablets options, they're inherited from the old KS. I tried various approaches and this one resulted in the least lines of code changed. I also provided testcases to explain how the code behaves. Fixes: #18795 Closes scylladb/scylladb#19368	2024-06-28 11:41:41 +03:00
Benny Halevy	b7f00ba4bf	schema_tables: remove dead code Well, even after 10 years, the c++ compilers still do not compile Java... And having that legacy code laying around not only it doesn't help anyone understand what's going on, but on the contrary, it's confusing and distracting. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-06-27 20:34:02 +03:00
Benny Halevy	5f6c411656	migration_manager: remove dead code Well, even after 10 years, the c++ compilers still do not compile Java... And having that legacy code laying around not only it doesn't help anyone understand what's going on, but on the contrary, it's confusing and distracting. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-06-27 20:30:33 +03:00
Avi Kivity	4d85db9f39	build: switch to C++23 Set the C++ dialect to C++23, allowing us to use the new features.	2024-06-27 19:36:13 +03:00
Avi Kivity	d14eec8160	config: avoid binding an lvalue reference to an rvalue reference config_file::add_deprecated_options() returns an lvalue reference to a parameter which itself is an rvalue reference. In C++20 this is bad practice (but not a bug in this case) as rvalue references are not expected to live past the call. In C++23, it fails to compile. Fix by accepting an lvalue reference for the parameter, and adjust the caller.	2024-06-27 19:36:13 +03:00
Avi Kivity	ed816afac4	readers: define query::partition_slice before using it in default argument C++23 made std::unique_ptr constexpr. A side effect of this (presumably) is that the compiler compiles it more eagerly, requiring the full definition of the class in std::make_unique, while it previously was content with finding the definition later. One victim of this change is the default argument of make_reversing_reader; define it earlier (by including its header) to build with C++23.	2024-06-27 19:36:13 +03:00
Piotr Dulikowski	f9abe52d3b	Merge 'test: auth: add random tag to resources in test_auth_v2_migration' from Marcin Maliszkiewicz Those tests are sometimes failing on CI and we have two hypothesis: 1. Something wrong with consistency of statements 2. Interruption from another test run (e.g. same queries performed concurrently or data remained after previous run) To exclude or confirm 2. we add random marker to avoid potential collision, in such case it will be clearly visible that wrong data comes from a different run. Related scylladb/scylladb#18931 Related scylladb/scylladb#18319 backport: no, just a test fix Closes scylladb/scylladb#19484 * github.com:scylladb/scylladb: test: auth: add random tag to resources in test_auth_v2_migration test: extend unique_name with random sufix	2024-06-27 17:35:14 +02:00
Avi Kivity	e5807555bd	test: define table_for_tests earlier C++23 made std::unique_ptr constexpr. A side effect of this (presumably) is that the compiler compiles it more eagerly, requiring the full definition of the class in std::make_unique, while it previously was content with finding the definition later. One victim of this change is table_for_tests; define it earlier to build with C++23.	2024-06-27 17:54:12 +03:00
Avi Kivity	d5ba0b4041	compaction: define compaction_group::table_state earlier C++23 made std::unique_ptr constexpr. A side effect of this (presumably) is that the compiler compiles it more eagerly, requiring the full definition of the class in std::make_unique, while it previously was content with finding the definition later. One victim of this change is compaction_group::table_state; define it earlier to build with C++23.	2024-06-27 17:54:12 +03:00
Avi Kivity	9ecf4ada49	compaction: compaction_group: define destructor out-of-line Define compaction_group::~compaction_group() out-of-line to prevent problems instantiating compaction_group::_table_state, which is an std::unique_ptr. In C++23, std::unique_ptr is constexpr, which means its methods (in this case the destructor) require seeing the definition of the class at the point of instantiation.	2024-06-27 17:54:12 +03:00
Avi Kivity	050e7bbd64	compaction_manager: define compaction_manager::strategy_control earlier C++23 made std::unique_ptr constexpr. A side effect of this (presumably) is that the compiler compiles it more eagerly, requiring the full definition of the class in std::make_unique, while it previously was content with finding the definition later. One victim of this change is compaction_manager::strategy_control; define it earlier to build with C++23.	2024-06-27 17:54:12 +03:00
Andrei Chekun	561e88f00e	[test.py] Throw meaningful error when something wrong wit Scylla binary Fixes: https://github.com/scylladb/scylladb/issues/19489 There is already a check that Scylla binary is executable, but it's done on later stage. So in logs for specific test file there will be a message about something wrong with binary, but in console there will be now signs of that. Moreover, there will be an error that completely misleads what actually happened and why test run failed. With this check test will fail earlier providing the correct reason why it's failed Closes scylladb/scylladb#19491	2024-06-27 17:38:32 +03:00
Avi Kivity	581d619572	storage_proxy: trace speculative retries A speculative retry can appear out of the blue[1] and confuse people, as it looks like the consistency level was elevated. Fix by adding such a tracepoint. Sample output: ``` activity \| timestamp \| source \| source_elapsed \| client ---------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+-----------+----------------+----------- Execute CQL3 query \| 2024-06-27 14:25:58.947000 \| 127.0.0.1 \| 0 \| 127.0.0.1 Parsing a statement [shard 0] \| 2024-06-27 14:25:58.947918 \| 127.0.0.1 \| 2 \| 127.0.0.1 Processing a statement for authenticated user: anonymous [shard 0] \| 2024-06-27 14:25:58.948025 \| 127.0.0.1 \| 108 \| 127.0.0.1 Creating read executor for token -4069959284402364209 with all: [127.0.0.1, 127.0.0.2] targets: [127.0.0.2] repair decision: NONE [shard 0] \| 2024-06-27 14:25:58.948125 \| 127.0.0.1 \| 209 \| 127.0.0.1 Added extra target 127.0.0.1 for speculative read [shard 0] \| 2024-06-27 14:25:58.948128 \| 127.0.0.1 \| 212 \| 127.0.0.1 Creating speculating_read_executor [shard 0] \| 2024-06-27 14:25:58.948129 \| 127.0.0.1 \| 213 \| 127.0.0.1 read_data: sending a message to /127.0.0.2 [shard 0] \| 2024-06-27 14:25:58.948138 \| 127.0.0.1 \| 222 \| 127.0.0.1 Launching speculative retry for data [shard 0] \| 2024-06-27 14:25:58.948234 \| 127.0.0.1 \| 318 \| 127.0.0.1 read_data: querying locally [shard 0] \| 2024-06-27 14:25:58.948235 \| 127.0.0.1 \| 319 \| 127.0.0.1 Start querying singular range {{-4069959284402364209, pk{000400000001}}} [shard 0] \| 2024-06-27 14:25:58.948246 \| 127.0.0.1 \| 330 \| 127.0.0.1 [reader concurrency semaphore user] admitted immediately [shard 0] \| 2024-06-27 14:25:58.948250 \| 127.0.0.1 \| 334 \| 127.0.0.1 [reader concurrency semaphore user] executing read [shard 0] \| 2024-06-27 14:25:58.948258 \| 127.0.0.1 \| 342 \| 127.0.0.1 Querying cache for range {{-4069959284402364209, pk{000400000001}}} and slice [(-inf, +inf)] [shard 0] \| 2024-06-27 14:25:58.948281 \| 127.0.0.1 \| 365 \| 127.0.0.1 Page stats: 1 partition(s), 0 static row(s) (0 live, 0 dead), 1 clustering row(s) (1 live, 0 dead) and 0 range tombstone(s) [shard 0] \| 2024-06-27 14:25:58.948311 \| 127.0.0.1 \| 395 \| 127.0.0.1 Querying is done [shard 0] \| 2024-06-27 14:25:58.948320 \| 127.0.0.1 \| 404 \| 127.0.0.1 read_data: message received from /127.0.0.1 [shard 0] \| 2024-06-27 14:25:58.948351 \| 127.0.0.2 \| 12 \| 127.0.0.1 Done processing - preparing a result [shard 0] \| 2024-06-27 14:25:58.948354 \| 127.0.0.1 \| 438 \| 127.0.0.1 Start querying singular range {{-4069959284402364209, pk{000400000001}}} [shard 0] \| 2024-06-27 14:25:58.948370 \| 127.0.0.2 \| 31 \| 127.0.0.1 [reader concurrency semaphore user] admitted immediately [shard 0] \| 2024-06-27 14:25:58.948374 \| 127.0.0.2 \| 35 \| 127.0.0.1 [reader concurrency semaphore user] executing read [shard 0] \| 2024-06-27 14:25:58.948388 \| 127.0.0.2 \| 49 \| 127.0.0.1 Querying cache for range {{-4069959284402364209, pk{000400000001}}} and slice [(-inf, +inf)] [shard 0] \| 2024-06-27 14:25:58.948405 \| 127.0.0.2 \| 66 \| 127.0.0.1 Page stats: 1 partition(s), 0 static row(s) (0 live, 0 dead), 1 clustering row(s) (1 live, 0 dead) and 0 range tombstone(s) [shard 0] \| 2024-06-27 14:25:58.948424 \| 127.0.0.2 \| 85 \| 127.0.0.1 Querying is done [shard 0] \| 2024-06-27 14:25:58.948430 \| 127.0.0.2 \| 91 \| 127.0.0.1 read_data handling is done, sending a response to /127.0.0.1 [shard 0] \| 2024-06-27 14:25:58.948436 \| 127.0.0.2 \| 97 \| 127.0.0.1 read_data: got response from /127.0.0.2 [shard 0] \| 2024-06-27 14:25:58.949140 \| 127.0.0.1 \| 1224 \| 127.0.0.1 Request complete \| 2024-06-27 14:25:58.947449 \| 127.0.0.1 \| 449 \| 127.0.0.1 ``` Ref #18988 [1] not completely out of the blue, `ff29f430` indicates that a speculative read can happen. Closes scylladb/scylladb#19520	2024-06-27 17:37:36 +03:00
Avi Kivity	0d23b8165e	build: update frozen toolchain to Fedora 40 with clang 18.1.6 This refreshes our dependencies to a supported distribution. Closes scylladb/scylladb#19205	2024-06-27 14:27:21 +03:00
Yaron Kaikov	efa94b06c2	.github/scripts/label_promoted_commits.py: fix adding labels when PR is closed `prs = response.json().get("items", [])` will return empty when there are no merged PRs, and this will just skip the all-label replacement process. This is a regression following the work done in #19442 Adding another part to handle closed PRs (which is the majority of the cases we have in Scylla core) Fixes: https://github.com/scylladb/scylladb/issues/19441 Closes scylladb/scylladb#19497	2024-06-27 14:00:44 +03:00
Pavel Emelyanov	6c1e5c248f	main,proxy: Drain proxy in its stop_remote Currently proxy initialization is pretty disperse, in particular it's stopped in several steps -- first drain_on_shutdown() then stop_remote(). In between there's nothing that needs proxy in any particular sate, so those two steps can be merged into one. refs: scylladb/scylladb#2737 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#19344	2024-06-27 12:26:51 +02:00
Pavel Emelyanov	1a219c674c	s3/client: Always retry http requests Real S3 server is known to actively close connections, thus breaking S3 storage backend at random places. The recent http client update is more robust against that, but the needed feature is OFF by default. refs: scylladb/seastar#1883 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#19461	2024-06-27 13:14:24 +03:00
Artsiom Mishuta	919d44e0c7	test: adjust scylla_cluster.merge_cmdline_options behavior adjust merge_cmdline_options behaviour to append --logger-log-level option instead of merge this behaviour can be changed(if needed) to previour version(all merge): merge_cmdline_options(list1, list2, appending_options=[]) or, to append different cmd options: merge_cmdline_options(list1, list2, appending_options=[option1,option2])	2024-06-27 10:03:31 +02:00
Artsiom Mishuta	440785bc41	test: pass scylla extra CMD args from test.py args this commit introduces a test.py option --extra-scylla-cmdline-options to pass extra scylla cmdline options for all tests. Options should be space-separated: '--logger-log-level raft=trace --default-log-level error'	2024-06-27 10:02:55 +02:00
Artsiom Mishuta	677173bf8b	test: generate core dumps on crashes in nodetool tests The nodetool tests does not set the asan/ubsan options to abort on error and create core dumps Fix by setting the environment variables in nodetool tests. Closes scylladb/scylladb#19503	2024-06-27 10:44:33 +03:00
Marcin Maliszkiewicz	b708c5701f	test: auth: add random tag to resources in test_auth_v2_migration Those tests are sometimes failing on CI and we have two hypothesis: 1. Something wrong with consistency of statements 2. Interruption from another test run (e.g. same queries performed concurrently or data remained after previous run) To exclude or confirm 2. we add random marker to avoid potential collision, in such case it will be clearly visible that wrong data comes from a different run. Related scylladb/scylladb#18931 Related scylladb/scylladb#18319	2024-06-27 09:28:27 +02:00
Marcin Maliszkiewicz	d08a80b34f	test: extend unique_name with random sufix This reduces collision risk in an unlikely and incorrect setup where tests would be run concurrently by multiple processes.	2024-06-27 09:28:02 +02:00
Anna Stuchlik	e2994a19d5	doc: update Scylla Doctor installation This commit updates the instuctions on how to download and run Scylla Doctor, following the changes in how Scylla Doctor is released. Closes scylladb/scylladb#19510	2024-06-27 10:22:08 +03:00
Botond Dénes	2fe50cda22	Merge 'chunked_vector enhancements' from Benny Halevy This short series enhances utils::chunked_vector so it could be used more easily to convert dht::partition_range_vector to chunked_vector, for example. - utils: chunked_vector: document invalidation of iterators on move - utils: chunked_vector: add ctor from std::initializer_list - utils: chunked_vector: add ctor from a single value No backport required Closes scylladb/scylladb#19462 * github.com:scylladb/scylladb: chunked_vector_test: add tests for value-initialization constructor utils: chunked_vector: add ctor from std::initializer_list utils: chunked_vector: document invalidation of iterators on move	2024-06-27 10:20:47 +03:00
Anna Stuchlik	072542a5cc	doc: add a page with ScyllaDB limits This commit adds a page listing the ScyllDB limits we know today. The page can and should be extended when other limits are confirmed. Closes scylladb/scylladb#19399	2024-06-27 08:28:51 +03:00
Kefu Chai	52f1168a3d	repair: remove unused operator<< since we've switched almost all callers of the operator<< to {fmt}, let's drop the unused operator<<:s. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19508	2024-06-26 21:57:03 +03:00
Israel Fruchter	3c7af28725	cqlsh: update cqlsh submodule this change updates the cqlsh submodule: * tools/cqlsh/ ba83aea3...73bdbeb0 (4): > install.sh: replace tab with spaces > define the the debug packge is empty > tests: switch from using cqlsh bash to the test the python file > package python driver as wheels it also includes follow change to package cqlsh as a regular rpm instead of as a "noarch" rpm: so far cqlsh bundles the python-driver in, but only as source. meaning the package wasn't architecture, and also didn't have the libev eventloop compiled in. Since from python 3.12 and up, that would mean we would fallback into asyncio eventloop (which still exprimental) or into error (once we'll sync with the driver upstream) so to avoid those, we are change the packaging of cqlsh to be architecture specific, and get cqlsh compiled, and bundle all of it's requirements as per architecture installed bundle of wheels. using `shiv`, i.e. one file virtualenv that we'll be packing into our artifacts Ref: https://github.com/scylladb/scylla-cqlsh/issues/90 Ref: https://github.com/scylladb/scylla-cqlsh/pull/91 Ref: https://github.com/linkedin/shiv Closes scylladb/scylladb#19385 * tools/cqlsh ba83aea...242876c (1): > Merge 'package python driver as wheels' from Israel Fruchter Update tools/cqlsh/ submodule in which, the change of `define the the debug packge is empty` should address the build failure like ``` Processing files: scylla-cqlsh-debugsource-6.1.0~dev-0.20240624.c7748f60c0bc.aarch64 error: Empty %files file /jenkins/workspace/scylla-master/next/scylla/tools/cqlsh/build/redhat/BUILD/scylla-cqlsh/debugsourcefiles.list RPM build errors: Empty %files file /jenkins/workspace/scylla-master/next/scylla/tools/cqlsh/build/redhat/BUILD/scylla-cqlsh/debugsourcefiles.list ``` Closes scylladb/scylladb#19473	2024-06-26 12:07:21 +03:00
Pavel Emelyanov	263668bc85	transport: Use sharded<>::invoke_on_others() When preparing statement, the server code first does it on non-local shards, then on local one. The former call is done the hard way, while there's a short sugar sharded<> class method doing it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#19485	2024-06-25 22:17:59 +03:00
Kamil Braun	13fc2bd854	Merge `notify other nodes on boot` from Gleb The series adds a step during node's boot process, just before completing the initialization, in which the node sends a notification to all other normal nodes in the cluster that it is UP now. Other nodes wait for this node to be UP and in normal state before replying. This ensures that, in a healthy cluster, when a node start serving queries the entire cluster knows its up-to-date state. The notification is a best effort though. If some nodes are down or do not reply in time the boot process continues. It is somewhat similar to shutdown notification in this regard. * 'gleb/notify-up-v2' of github.com:scylladb/scylla-dev: gossiper: wait for a bootstrapping node to be seen as normal on all nodes before completing initialization Wait for booting node to be marked UP before complete booting. gossiper: move gossip verbs to the idl	2024-06-25 17:58:17 +02:00
Aleksandra Martyniuk	2394e3ee7a	repair: drop timeout from table_sync_and_check Delete 10s timeout from read barrier in table_sync_and_check, so that the function always considers all previous group0 changes. Fixes: #18490. Closes scylladb/scylladb#18752	2024-06-25 17:44:31 +02:00
Avi Kivity	c80dc57156	Merge 'batchlog replay: bypass tombstones generated by past replays' from Botond Dénes The `system.batchlog` table has a partition for each batch that failed to complete. After finally applying the batch, the partition is deleted. Although the table has gc_grace_second = 0, tombstones can still accumulate in memory, because we don't purge partition tombstones from either the memtable or the cache. This can lead to the cache and memtable of this table to accumulate many thousands of even millions of tombstones, making batchlog replay very slow. We didn't notice this before, because we would only replay all failed batches on unbootstrap, which is rare and a heavy and slow operation on its own right already. With repair-based tombstone-gc however, we do a full batchlog replay at the beginning of each repair, and now this extra delay is noticeable. Fix this by making sure batchlog replays don't have to scan through all the tombstones generated by previous replays: * flush the `system.batchlog` memtable at the end of each batchlog replay, so it is cleared of tombstones * bypass the cache Fixes: https://github.com/scylladb/scylladb/issues/19376 Although this is not a regression -- replay was like this since forever -- now that repair calls into batchlog replay, every release which uses repair-based tombstone-gc should get this fix Closes scylladb/scylladb#19377 * github.com:scylladb/scylladb: db/batchlog_manager: bypass cache when scanning batchlog table db/batchlog_manager: replace open-coded paging with internal one db/batchlog_manager: implement cleanup after all batchlog replay cql3/query_processor: for_each_cql_result(): move func to the coro frame	2024-06-25 16:11:01 +03:00

1 2 3 4 5 ...

43385 Commits