scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 03:20:37 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	a59096aa70	sstables, database: Move object storage config maintenance onto storage_manager Right now the map<endpoint, config> sits on the sstables manager and its update is governed by database (because it's peering and can kick other shards to update it as well). Having the sharded<storage_manager> at hand lets freeing database from the need to update configs and keeps sstables_manager a bit smaller. Also this will allow keeping s3 clients shared between sstables via this map by next patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-11 19:39:00 +03:00
Pavel Emelyanov	2153751d45	sstables: Introduce sharded<storage_manager> The manager in question keeps track of whatever sstables_manager needs to work with the storage (spoiler: only S3 one). It's main-local sharded peering service, so that container() call can be used by next patches. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-11 19:36:01 +03:00
Avi Kivity	550aa01242	Merge 'Restore raft::internal::tagged_uint64 type' from Benny Halevy Change `f5f566bdd8` introduced tagged_integer and replaced raft::internal::tagged_uint64 with utils::tagged_integer. However, the idl type for raft::internal::tagged_uint64 was not marked as final, but utils::tagged_integer is, breaking the on-the-wire compatibility. This change restores the use of raft::internal::tagged_uint64 for the raft types and adds back an idl definition for it that is not marked as final, similar to the way raft::internal::tagged_id extends utils::tagged_uuid. Fixes #13752 Closes #13774 * github.com:scylladb/scylladb: raft, idl: restore internal::tagged_uint64 type raft: define term_t as a tagged uint64_t idl: gossip_digest: include required headers	2023-05-09 22:51:25 +03:00
Kefu Chai	d8cd62b91a	compaction/compaction: initialize local variable the initial `validation_errors` should be zero. so let's initialize it instead of leaving it to uninitialized. this should address following warning from Clang-16: ``` /usr/bin/clang++ -DDEBUG -DDEBUG_LSA_SANITIZER -DFMT_DEPRECATED_OSTREAM -DFMT_SHARED -DSANITIZE -DSCYLLA_BUILD_MODE=debug -DSCYLLA_ENABLE_ERROR_INJECTION -DSEASTAR_API_LEVEL=6 -DSEASTAR_DEBUG -DSEASTAR_DEBUG_SHARED_PTR -DSEASTAR_DEFAULT_ALLOCATOR -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SHUFFLE_TASK_QUEUE -DSEASTAR_TYPE_ERASE_MORE -DXXH_PRIVATE_API -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/cmake/seastar/gen/include -I/home/kefu/dev/scylladb/build/cmake/gen -isystem /home/kefu/dev/scylladb/build/cmake/rust -Wall -Werror -Wno-error=deprecated-declarations -Wno-c++11-narrowing -Wno-mismatched-tags -Wno-overloaded-virtual -Wno-unsupported-friend -march=westmere -Og -g -gz -std=gnu++20 -fvisibility=hidden -U_FORTIFY_SOURCE -DSEASTAR_SSTRING -Wno-error=unused-result "-Wno-error=#warnings" -fstack-clash-protection -fsanitize=address -fsanitize=undefined -fno-sanitize=vptr -MD -MT compaction/CMakeFiles/compaction.dir/compaction.cc.o -MF compaction/CMakeFiles/compaction.dir/compaction.cc.o.d -o compaction/CMakeFiles/compaction.dir/compaction.cc.o -c /home/kefu/dev/scylladb/compaction/compaction.cc /home/kefu/dev/scylladb/compaction/compaction.cc:1681:9: error: variable 'validation_errors' is uninitialized when used here [-Werror,-Wuninitialized] validation_errors += co_await sst->validate(permit, descriptor.io_priority, cdata.abort, [&schema] (sstring what) { ^~~~~~~~~~~~~~~~~ /home/kefu/dev/scylladb/compaction/compaction.cc:1676:31: note: initialize the variable 'validation_errors' to silence this warning uint64_t validation_errors; ^ = 0 ``` the change which introduced this local variable was `7ba5c9cc6a`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13813	2023-05-09 22:49:29 +03:00
Avi Kivity	8c6229d229	Merge 'sstable: encode value using UUID' from Kefu Chai in this series, we encode the value of generation using UUID to prepare for the UUID generation identifier. simpler this way, as we don't need to have two ways to encode integer or a timeduuid: uuid with a zero timestamp, and a variant. also, add a `from_string()` factory method to convert string to generation to hide the the underlying type of value from generation_type's users. Closes #13782 * github.com:scylladb/scylladb: sstable: use generation_type::from_string() to convert from string sstable: encode int using UUID in generation_type	2023-05-09 22:07:23 +03:00
Avi Kivity	996f717dfc	Merge 'cql3/prepare_expr: force token() receiver name to be partition key token' from Jan Ciołek Let's say that we have a prepared statement with a token restriction: ```cql SELECT * FROM some_table WHERE token(p1, p2) = ? ``` After calling `prepare` the drivers receives some information about the prepared statment, including names of values bound to each bind marker. In case of a partition token restriction (`token(p1, p2) = ?`) there's an expectation that the name assigned to this bind marker will be `"partition key token"`. In a recent change the code handling `token()` expressions has been unified with the code that handles generic function calls, and as a result the name has changed to `token(p1, p2)`. It turns out that the Java driver relies on the name being `"partition key token"`, so a change to `token(p1, p2)` broke some things. This patch sets the name back to `"partition key token"`. To achieve this we detect any restrictions that match the pattern `token(p1, p2, p3) = X` and set the receiver name for X to `"partition key token"`. Fixes: #13769 Closes #13815 * github.com:scylladb/scylladb: cql-pytest: test that bind marker is partition key token cql3/prepare_expr: force token() receiver name to be partition key token	2023-05-09 20:44:46 +03:00
Anna Stuchlik	c64109d8c7	doc: add driver support for Serverless Fixes https://github.com/scylladb/scylladb/issues/13453 This is V2 of https://github.com/scylladb/scylladb/pull/13710/. This commit adds: - the information about which ScyllaDB drivers support ScyllaDB Cloud Serverless. - language and organization improvements to the ScyllaDB CQL Drivers page. Closes #13825	2023-05-09 20:43:22 +03:00
Kefu Chai	c872ade50f	sstable: use generation_type::from_string() to convert from string in this change, * instead of using "\d+" to match the generation, use "[^-]", * let generation_type to convert a string to generation before this change, we casts the matched string in SSTable file name to integer and then construct a generation identifier from the integer. this solution has a strong assumption that the generation is represented with an integer, we should not encode this assumption in sstable.cc, instead we'd better let generation_type itself to take care of this. also, to relax the restriction of regex for matching generation, let's just use any characters except for the delimeter -- "-". Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 22:57:39 +08:00
Kefu Chai	478c13d0d4	sstable: encode int using UUID in generation_type since we already use UUID for encoding an bigint in SSTable registry table, let's just use the same approach for encoding bigint in generation_type, to be more consistent, and less repeatings this way. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 22:57:38 +08:00
Botond Dénes	287ccce1cc	Merge 'sstables: extract storage out ' from Kefu Chai this change extracts the storage class and its derived classes out into their own source files. for couple reasons: - for better readability. the sstables.hh is over 1005 lines. and sstables.cc 3602 lines. it's a little bit difficult to figure out how the different parts in these sources interact with each other. for instance, with this change, it's clear some of helper functions are only used by file_system_storage. - probably less inter-source dependency. by extracting the sources files out, they can be compiled individually, so changing one .cc file does not impact others. this could speed up the compilation time. Closes #13785 * github.com:scylladb/scylladb: sstables: storage: coroutinize idempotent_link_file() sstables: extract storage out	2023-05-09 14:03:40 +03:00
Jan Ciolek	9ad1c5d9f2	cql-pytest: test that bind marker is partition key token When preparing a query each bind marker gets a name. For a query like: ```cql SELECT * FROM some_table WHERE token(p1, p2) = ? ``` The bind marker's name should be `"partition key token"`. Java driver relies on this name, having something else, like `"token(p1, p2)"` be the name breaks the Java driver. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-05-09 12:33:06 +02:00
Jan Ciolek	8a256f63db	cql3/prepare_expr: force token() receiver name to be partition key token Let's say that we have a prepared statement with a token restriction: ```cql SELECT * FROM some_table WHERE token(p1, p2) = ? ``` After calling `prepare` the drivers receives some information about the prepared statment, including names of values bound to each bind marker. In case of a partition token restriction (`token(p1, p2) = ?`) there's an expectation that the name assigned to this bind marker will be `"partition key token"`. In a recent change the code handling `token()` expressions has been unified with the code that handles generic function calls, and as a result the name has changed to `token(p1, p2)`. It turns out that the Java driver relies on the name being `"partition key token"`, so a change to `token(p1, p2)` broke some things. This patch sets the name back to `"partition key token"`. To achieve this we detect any restrictions that match the pattern `token(p1, p2, p3) = X` and set the receiver name for X to `"partition key token"`. Fixes: #13769 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-05-09 12:32:57 +02:00
Benny Halevy	adfb79ba3e	raft, idl: restore internal::tagged_uint64 type Change `f5f566bdd8` introduced tagged_integer and replaced raft::internal::tagged_uint64 with utils::tagged_integer. However, the idl type for raft::internal::tagged_uint64 was not marked as final, but utils::tagged_integer is, breaking the on-the-wire compatibility. This change defines the different raft tagged_uint64 types in idl/raft_storage.idl.hh as non-final to restore the way they were serialized prior to `f5f566bdd8` Fixes #13752 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-09 12:38:20 +03:00
Kamil Braun	41cac23aa4	Merge 'raft: verify RPC destination ID' from Mikołaj Grzebieluch All Raft verbs include `dst_id`, the ID of the destination server, but it isn't checked. `append_entries` will work even if it arrives at completely the wrong server (but in the same group). It can cause problems, e.g. in the scenario of replacing a dead node. This commit adds verifying if `dst_id` matches the server's ID and if it doesn't, the Raft verb is rejected. Closes #12179 Testing --- Testcase and scylla's configuration: `57d3ef14d8` It artificially lengthens the duration of replacing the old node. It increases the chance of getting the RPC command sent to a replaced node, by the new node. In the logs of the node that replaced the old one, we can see logs in the form: ``` DEBUG <time> [shard 0] raft_group_registry - Got message for server <dst_id>, but my id is <my_id> ``` It indicates that the Raft verb with the wrong `dst_id` was rejected. This test isn't included in the PR because it doesn't catch any specific error. Closes #13575 * github.com:scylladb/scylladb: service/raft: raft_group_registry: Add verification of destination ID service/raft: raft_group_registry: `handle_raft_rpc` refactor	2023-05-09 11:33:28 +02:00
Kefu Chai	a69282e69b	sstables: storage: coroutinize idempotent_link_file() Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 16:47:00 +08:00
Kefu Chai	2eefcb37eb	sstables: extract storage out this change extracts the storage class and its derived classes out into storage.cc and storage.hh. for couple reasons: - for better readability. the sstables.hh is over 1005 lines. and sstables.cc 3602 lines. it's a little bit difficult to figure out how the different parts in these sources interact with each other. for instance, with this change, it's clear some of helper functions are only used by file_system_storage. - probably less inter-source dependency. by extracting the sources files out, they can be compiled individually, so changing one .cc file does not impact others. this could speed up the compilation time. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 16:47:00 +08:00
Botond Dénes	20f620feb9	Merge 'replica, sstable: replace generation_type::value() with generation_type::as_int()' from Kefu Chai this series prepares for the UUID based generation by replacing the general `value()` function with the function with more specific name: `as_int()`. Closes #13796 * github.com:scylladb/scylladb: test: drop a reusable_sst() variant which accepts int as generation treewide: replace generation_type::value() with generation_type::as_int()	2023-05-09 07:30:54 +03:00
Benny Halevy	531ac63a8d	raft: define term_t as a tagged uint64_t It was defined as a tagged (signed) int64_t by mistake in `f5f566bdd8`. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-09 06:51:26 +03:00
Benny Halevy	d3a59fdefd	idl: gossip_digest: include required headers To be self-sufficient, before the next patch that will affect tagged_integer. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-09 06:51:26 +03:00
Michał Chojnowski	0813fa1da0	database: fix reads_memory_consumption for system semaphore The metric shows the opposite of what its name suggests. It shows available memory rather than consumed memory. Fix that. Fixes #13810 Closes #13811	2023-05-09 06:42:43 +03:00
Nadav Har'El	5f37d43ee6	Merge 'compaction: validate: validate the index too' from Botond Dénes In addition to the data file itself. Currently validation avoids the index altogether, using the crawling reader which only relies on the data file and ignores the index+summary. This is because a corrupt sstable usually has a corrupt index too and using both at the same time might hide the corruption. This patch adds targeted validation of the index, independent of and in addition to the already existing data validation: it validates the order of index entries as well as whether the entry points to a complete partition in the data file. This will usually result in duplicate errors for out-of-order partitions: one for the data file and one for the index file. Fixes: #9611 Closes #11405 * github.com:scylladb/scylladb: test/cql-pytest: add test_sstable_validation.py test/cql-pytest: extract scylla_path,temp_workdir fixtures to conftest.py tools/scylla-sstables: write validation result to stdout sstables/sstable: validate(): delegate to mx validator for mx sstables sstables/mx/reader: add mx specific validator mutation/mutation_fragment_stream_validator: add validator() accessor to validating filter sstables/mx/reader: template data_consume_rows_context_m on the consumer sstables/mx/reader: move row_processing_result to namespace scope sstables/mx/reader: use data_consumer::proceed directly sstables/mx/reader.cc: extend namespace to end-of-file (cosmetic) compaction/compaction: remove now unused scrub_validate_mode_validate_reader() compaction/compaction: move away from scrub_validate_mode_validate_reader() tools/scylla-sstable: move away from scrub_validate_mode_validate_reader() test/boost/sstable_compaction_test: move away from scrub_validate_mode_validate_reader() sstables/sstable: add validate() method compaction/compaction: scrub_sstables_validate_mode(): validate sstables one-by-one compaction: scrub: use error messages from validator mutation_fragment_stream_validator: produce error messages in low-level validator	2023-05-08 17:14:26 +03:00
Botond Dénes	b790f14456	reader_concurrency_semaphore: execution_loop(): trigger admission check when _ready_list is empty The execution loop consumes permits from the _ready_list and executes them. The _ready_list usually contains a single permit. When the _ready_list is not empty, new permits are queued until it becomes empty. The execution loops relies on admission checks triggered by the read releasing resouces, to bring in any queued read into the _ready_list, while it is executing the current read. But in some cases the current read might not free any resorces and thus fail to trigger an admission check and the currently queued permits will sit in the queue until another source triggers an admission check. I don't yet know how this situation can occur, if at all, but it is reproducible with a simple unit test, so it is best to cover this corner-case in the off-chance it happens in the wild. Add an explicit admission check to the execution loop, after the _ready_list is exhausted, to make sure any waiters that can be admitted with an empty _ready_list are admitted immediately and execution continues. Fixes: #13540 Closes #13541	2023-05-08 17:11:41 +03:00
Takuya ASADA	fdceda20cc	scylla_raid_setup: wipe filesystem signatures from specified disks The discussion on the thread says, when we reformat a volume with another filesystem, kernel and libblkid may skip to populate /dev/disk/by-* since it detected two filesystem signatures, because mkfs.xxx did not cleared previous filesystem signature. To avoid this, we need to run wipefs before running mkfs. Note that this runs wipefs twice, for target disks and also for RAID device. wipefs for RAID device is needed since wipefs on disks doesn't clear filesystem signatures on /dev/mdX (we may see previous filesystem signature on /dev/mdX when we construct RAID volume multiple time on same disks). Also dropped -f option from mkfs.xfs, it will check wipefs is working as we expected. Fixes #13737 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Closes #13738	2023-05-08 16:53:43 +03:00
Anna Stuchlik	98e1d7a692	doc: add the Elixir driver to the docs This commit adds the link to the Exlixir driver to the list of the third-party drivers. The driver actively supports ScyllaDB. This is v2 of https://github.com/scylladb/scylladb/pull/13701 Closes #13806	2023-05-08 15:36:35 +03:00
Kamil Braun	153cb00e9d	test: test_random_tables: wait for token ring convergence before data queries The test performs an `INSERT` followed by a `SELECT`, checking if the previously inserted data is returned. This may fail because we're using `ring_delay = 0` in tests and the two queries may arrive at different nodes, whose `token_metadata` didn't converge yet (it's eventually consistent based on gossiping). I illustrated this here: https://github.com/scylladb/scylladb/issues/12937#issuecomment-1536147455 Ensure that the nodes' token rings are synchronized (by waiting until the token ring members on each node is the same as group 0 configuration). Fixes #12937 Closes #13791	2023-05-08 13:22:52 +02:00
Kamil Braun	3f3dcf451b	test: pylib: random_tables: perform read barrier in `verify_schema` `RandomTables.verify_schema` is often called in topology tests after performing a schema change. It compares the schema tables fetched from some node to the expected latest schema stored by the `RandomTables` object. However there's no guarantee that the latest schema change has already propagated to the node which we query. We could have performed the schema change on a different node and the change may not have been applied yet on all nodes. To fix that, pick a specific node and perform a read barrier on it, then use that node to fetch the schema tables. Fixes #13788 Closes #13789	2023-05-08 13:21:10 +02:00
Avi Kivity	198738f2b1	Merge 'build: compile wasm udfs automatically' from Wojciech Mitros Currently, when we deal with a Wasm program, we store it in its final WebAssembly Text form. This causes a lot of code bloat and is hard to read. Instead, we would like to store only the source codes, and build Wasm when necessary. This series adds build commands that compile C/Rust sources to Wasm and uses them for Wasm programs that we're already using. After these changes, adding a new program that should be compiled to Rust, requires only adding the source code of it and updating the `wasms` and `wasm_deps` lists in `configure.py`. All Wasm programs are build by default when building all artifacts, artifacts in a given mode, or when building tests. Additionally, a {mode}-wasm target is added, so that it's possible to build just the wasm files. The generated files are saved in $builddir/{mode}/wasm, and are accessed in cql-pytests similarly to the way we're accessing the scylla binary - using glob. Closes #13209 * github.com:scylladb/scylladb: wasm: replace wasm programs with their source programs build: prepare rules for compiling wasm files build: set the type of build_artifacts test: extend capabilities of Wasm reading helper funciton	2023-05-08 13:51:53 +03:00
Wojciech Mitros	6d89d718d9	wasm: replace wasm programs with their source programs After recent changes, we are able to store only the C/Rust source codes for Wasm programs, and only build them when neccessary. This patch utilizes this opportunity by removing most of the currently stored raw Wasm programs, replacing them with C/Rust sources and adding them to the new build system.	2023-05-08 10:47:34 +02:00
Wojciech Mitros	c065ae0ded	build: prepare rules for compiling wasm files Currently, when we deal with a Wasm program, we store it in its final WebAssembly Text form. This causes a lot of code bloat and is hard to read. Instead, we would like to store only the (C/Rust) source codes, and build Wasm when neccessary. This patch adds build commands that compile C/Rust sources to Wasm. After these changes, adding a new program that should be compiled to Rust, requires only adding the source code of it and updating the wasms and wasm_deps lists in configure.py. All Wasm programs are build by default when building all artifacts, all artifacts in a given mode, or when building tests. Additionally, a ninja wasm target is added, so that it's possible to build just the wasm files. The generated files are saved in $builddir/wasm.	2023-05-08 10:47:34 +02:00
Wojciech Mitros	c53d68ee3e	build: set the type of build_artifacts Currently, build_artifacts are of type set[str] \| list, which prevents us from performing set operations on it. In a future patch, we will want to take a set difference and set intersections with it, so we initialize the type of build_artifacts to a set in all cases.	2023-05-08 10:47:34 +02:00
Wojciech Mitros	0a34a54c73	test: extend capabilities of Wasm reading helper funciton Currently, we require that the Wasm file is named the same as the funciton. In the future we may want multiple functions with the same name, which we can't currently do due to this limitation. This patch allows specifying the function name, so that multiple files can have a function with the same name. Additionally, the helper method now escapes "'" characters, so that they can appear in future Wasm files.	2023-05-08 10:47:34 +02:00
Botond Dénes	ab5fd0f750	Merge 's3: Provide timestamps in the s3 file implementation' from Raphael "Raph" Carvalho SSTable relies on st.st_mtime for providing creation time of data file, which in turn is used by features like tombstone compaction. Therefore, let's implement it. Fixes https://github.com/scylladb/scylladb/issues/13649. Closes #13713 * github.com:scylladb/scylladb: s3: Provide timestamps in the s3 file implementation s3: Introduce get_object_stats() s3: introduce get_object_header()	2023-05-08 11:43:41 +03:00
Raphael S. Carvalho	ad471e5846	s3: Provide timestamps in the s3 file implementation SSTable relies on st.st_mtime for providing creation time of data file, which in turn is used by features like tombstone compaction. Fixes #13649. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-07 19:51:12 -03:00
Raphael S. Carvalho	57661f0392	s3: Introduce get_object_stats() get_object_stats() will be used for retrieving content size and also last modified. The latter is required for filling st_mtim, etc, in the s3::client::readable_file::stat() method. Refs #13649. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-07 19:51:10 -03:00
Raphael S. Carvalho	da2ccc44a4	s3: introduce get_object_header() This allows other functions to reuse the code to retrieve the object header. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-07 19:49:52 -03:00
Kefu Chai	5fa459bd1a	treewide: do not include unused header since #13452, we switched most of the caller sites from std::regex to boost::regex. in this change, all occurences of `#include <regex>` are dropped unless std::regex is used in the same source file. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13765	2023-05-07 19:01:29 +03:00
Kefu Chai	468460718a	utils: UUID: drop uint64_t_tri_compare() functinoality wise, `uint64_t_tri_compare()` is identical to the three-way comparison operator, so no need to keep it. in this change, it is dropped in favor of <=>. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13794	2023-05-07 18:07:49 +03:00
Tomasz Grabiec	d8826acaa3	tablets: Fix stack smashing in tablet_map_to_mutation() The code was incorrectly passing a data_value of type bytes due to implicit conversion of the result of serialize() (bytes_opt) to a data_value object of type bytes_type via: data_value(std::optional<NativeType>); mutation::set_static_cell() accepts a data_value object, which is then serialized using column's type in abstract_type::decompose(data_value&): bytes b(bytes::initialized_later(), serialized_size(*this, value._value)); auto i = b.begin(); value.serialize(i); Notice that serialized_size() is taken from the column type, but serialization is done using data_value's type. The two types may have a compatible CQL binary representation, but may differ in native types. serialized_size() may incorrectly interpret the native type and come up with the wrong size. If the size is too smaller, we end up with stack or heap corruption later after serialize(). For example, if the column type is utf8 but value holds bytes, the size will be wrong because even though both use the basic_sstring type, they have a different layout due to max_size (15 vs 31). Fixes #13717 Closes #13787	2023-05-07 14:07:50 +03:00
Botond Dénes	c1e8e86637	reader_concurrency_semaphore: reader_permit: clean-up after failed memory requests When requesting memory via `reader_permit::request_memory()`, the requested amount is added to `_requested_memory` member of the permit impl. This is because multiple concurrent requests may be blocked and waiting at the same time. When the requests are fulfilled, the entire amount is consumed and individual requests track their requested amount with `resource_units` to release later. There is a corner-case related to this: if a reader permit is registered as inactive while it is waiting for memory, its active requests are killed with `std::bad_alloc`, but the `_requested_memory` fields is not cleared. If the read survives because the killed requests were part of a non-vital background read-ahead, a later memory request will also include amount from the failed requests. This extra amount wil not be released and hence will cause a resource leak when the permit is destroyed. Fix by detecting this corner case and clearing the `_requested_memory` field. Modify the existing unit test for the scenario of a permit waiting on memory being registered as inactive, to also cover this corner case, reproducing the bug. Fixes: #13539 Closes #13679	2023-05-07 14:06:51 +03:00
Kefu Chai	bd3e8d0460	test: drop a reusable_sst() variant which accepts int as generation this is one of the changes to reduce the usage of integer based generation test. in future, we will need to expand the test to exercise the UUID based generation, or at least to be neutral to the underlying generation's identifier type. so, to remove the helpers which only accept `generation_type::int_t` would helps us to make this happen. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-06 18:24:48 +08:00
Kefu Chai	9b35faf485	treewide: replace generation_type::value() with generation_type::as_int() * replace generation_type::value() with generation_type::as_int() * drop generation_value() because we will switch over to UUID based generation identifier, the member function or the free function generation_value() cannot fulfill the needs anymore. so, in this change, they are consolidated and are replaced by "as_int()", whose name is more specific, and will also work and won't be misleading even after switching to UUID based generation identifier. as `value()` would be confusing by then: it could be an integer or a UUID. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-06 18:24:45 +08:00
Kamil Braun	aba31ad06c	storage_service: use `seastar::format` instead of `fmt::format` For some reason Scylla crashes on `aarch64` in release mode when calling `fmt::format` in `raft_removenode` and `raft_decommission`. E.g. on this line: ``` group0_command g0_cmd = _group0->client().prepare_command(std::move(change), guard, fmt::format("decomission: request decomission for {}", raft_server.id())); ``` I found this in our configure.py: ``` def get_clang_inline_threshold(): if args.clang_inline_threshold != -1: return args.clang_inline_threshold elif platform.machine() == 'aarch64': # we see miscompiles with 1200 and above with format("{}", uuid) # also coroutine miscompiles with 600 return 300 else: return 2500 ``` but reducing it to `0` didn't help. I managed to get the following backtrace (with inline threshold 0): ``` void boost::intrusive::list_impl<boost::intrusive::mhtraits<seastar::thread_context, boost::intrusive::list_member_hook<>, &seastar::thread_context::_all_link>, unsigned long, false, void>::clear_and_dispose<boost::intrusive::detail::null_disposer>(boost::intrusive::detail::null_disposer) at /usr/include/boost/intrusive/list.hpp:751 (inlined by) boost::intrusive::list_impl<boost::intrusive::mhtraits<seastar::thread_context, boost::intrusive::list_member_hook<>, &seastar::thread_context::_all_link>, unsigned long, false, void>::clear() at /usr/include/boost/intrusive/list.hpp:728 (inlined by) ~list_impl at /usr/include/boost/intrusive/list.hpp:255 void fmt::v9::detail::buffer<wchar_t>::append<wchar_t>(wchar_t const, wchar_t const) at ??:? void fmt::v9::detail::vformat_to<char>(fmt::v9::detail::buffer<char>&, fmt::v9::basic_string_view<char>, fmt::v9::basic_format_args<fmt::v9::basic_format_context<std::conditional<std::is_same<fmt::v9::type_identity<char>::type, char>::value, fmt::v9::appender, std::back_insert_iterator<fmt::v9::detail::buffer<fmt::v9::type_identity<char>::type> > >::type, fmt::v9::type_identity<char>::type> >, fmt::v9::detail::locale_ref) at ??:? fmt::v9::vformat[abi:cxx11](fmt::v9::basic_string_view<char>, fmt::v9::basic_format_args<fmt::v9::basic_format_context<fmt::v9::appender, char> >) at ??:? std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > fmt::v9::format<utils::tagged_uuid<raft::server_id_tag>&>(fmt::v9::basic_format_string<char, fmt::v9::type_identity<utils::tagged_uuid<raft::server_id_tag>&>::type>, utils::tagged_uuid<raft::server_id_tag>&) at /usr/include/fmt/core.h:3206 (inlined by) service::storage_service::raft_removenode(utils::tagged_uuid<locator::host_id_tag>) at ./service/storage_service.cc:3572 ``` Maybe it's a bug in `fmt` library? In any case replacing the call with `::format` (i.e. `seastar::format` from seastar/core/print.hh) helps. Do it for the entire file for consistency (and avoiding this bug). Also, for the future, replace `format` calls with `::format` - now it's the same thing, but the latter won't clash with `std::format` once we switch to libstdc++13. Fixes #13707 Closes #13711	2023-05-05 19:23:22 +02:00
Kamil Braun	70f2b09397	Merge 'scylla_cluster.py: fix read_last_line' from Gusev Petr This is a follow-up to #13399, the patch addresses the issues mentioned there: * linesep can be split between blocks; * linesep can be part of UTF-8 sequence; * avoid excessively long lines, limit to 256 chars; * the logic of the function made simpler and more maintainable. Closes #13427 * github.com:scylladb/scylladb: pylib_test: add tests for read_last_line pytest: add pylib_test directory scylla_cluster.py: fix read_last_line scylla_cluster.py: move read_last_line to util.py	2023-05-05 13:29:15 +02:00
Botond Dénes	1e9dcaff01	Merge 'build: cmake: use Seastar API level 6' from Kefu Chai to avoid the FTBFS after we bump up the Seastar submodule which bumped up its API level to v7. and API v7 is a breaking change. so, in order to unbreak the build, we have to hardwire the API level to 6. `configure.py` also does this. Closes #13780 * github.com:scylladb/scylladb: build: cmake: disable deprecated warning build: cmake: use Seastar API level 6	2023-05-05 13:55:34 +03:00
Kefu Chai	05a172c7e7	build: cmake: link against Boost::unit_test_framework we introduced the linkage to Boost::unit_test_framework in `fe70333c19`, this library is used by test/lib/test_utils.cc, so update CMake accordingly. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13781	2023-05-05 13:55:00 +03:00
Petr Gusev	8a0bcf9d9d	pylib_test: add tests for read_last_line	2023-05-05 12:57:43 +04:00
Petr Gusev	7476e91d67	pytest: add pylib_test directory We want to add tests for read_last_line, in this commit we add a new directory for them since there were no tests for pylib code before.	2023-05-05 12:57:43 +04:00
Petr Gusev	330d1d5163	scylla_cluster.py: fix read_last_line This is a follow-up to #13399, the patch addresses the issues mentioned there: * linesep can be split between blocks; * linesep can be part of UTF-8 sequence; * avoid excessively long lines, limit to 512 chars; * the logic of the function made simpler and more maintainable.	2023-05-05 12:57:36 +04:00
Petr Gusev	8a5e211c30	scylla_cluster.py: move read_last_line to util.py We want to add tests for read_last_line, so we move it to make this simper.	2023-05-05 12:51:25 +04:00
Botond Dénes	687a8bb2f0	Merge 'Sanitize test::filename(sstable) API' from Pavel Emelyanov There are two of them currently with slightly different declaration. Better to leave only one. Closes #13772 * github.com:scylladb/scylladb: test: Deduplicate test::filename() static overload test: Make test::filename return fs::path	2023-05-05 11:36:08 +03:00

1 2 3 4 5 ...

36680 Commits