scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Author	SHA1	Message	Date
Benny Halevy	a290505239	utils: stall_free: add dispose_gently dispose_gently consumes the object moved to it, clearing it gently before it's destroyed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#26356	2025-11-11 12:20:18 +02:00
Calle Wilund	565c701226	utils::gcp::object_storage: Fix typo in semaphore init Fixes #26776 Semaphore storage is ssize_t, not size_t.	2025-11-05 10:22:22 +00:00
Pavel Emelyanov	ae0136792b	utils: Make directory_lister use generator lister from seastar The directory_lister uses utils::lister under the hood which accepts a callback to put directory_entry-s in. The directory_lister's callback then puts the entries into a queue and its .get() method pops up entries from there to return to caller. This patch simplifies this code by switching the directory_lister to use experimental generator lister from seastar. With it, the entries to be returned from .get() are simply co_await-ed from calling the generator object (wich co_yield-s them). As a result the directory_lister becomes smaller and drops the need for utils::lister. Since directory_lister was created as a replacement for that callback-based lister, the latter can be eventually removed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#26586	2025-10-28 15:20:20 +02:00
Pavel Emelyanov	d9bfbeda9a	lister: Fix race between readdir and stat Sometimes file::list_directory() returns entries without type set. In thase case lister calls file_type() on the entry name to get it. In case the call returns disengated type, the code assumes that some error occurred and resolves into exception. That's not correct. The file_type() method returns disengated type only if the file being inspected is missing (i.e. on ENOENT errno). But this can validly happen if a file is removed bettween readdir and stat. In that case it's not "some error happened", but a enry should be just skipped. In "some error happened", then file_type() would resolve into exceptional future on its own. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#26595	2025-10-28 15:10:22 +02:00
Lakshmi Narayanan Sreethar	64c1ec99e0	cmake: link crypto lib to utils The utils library requires OpenSSL's libcrypto for cryptographic operations and without linking libcrypto, builds fail with undefined symbol errors. Fix that by linking `crypto` to `utils` library when compiled with cmake. The build files generated with configure.py already have `crypto` lib linked, so they do not have this issue. Fix #26705 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com> Closes scylladb/scylladb#26707	2025-10-28 14:11:03 +02:00
Pavel Emelyanov	54a117b19d	Merge 'retry_strategy: Switch to using seastar's retry_strategy (take two)' from Ernest Zaslavsky With the recent introduction of retry_strategy to Seastar, the pure virtual class previously defined in ScyllaDB is now redundant. This change allows us to streamline our codebase by directly inheriting from Seastar’s implementation, eliminating duplication in ScyllaDB. Despite this update is purely a refactoring effort and does not introduce functional changes it should be ported back to 2025.3 and 2025.4 otherwise it will make future backports of bugfixes/improvements related to `s3_client` near to impossible ref: https://github.com/scylladb/seastar/issues/2803 depends on: https://github.com/scylladb/seastar/pull/2960 Closes scylladb/scylladb#25801 * github.com:scylladb/scylladb: s3_client: remove unnecessary `co_await` in `make_request` s3 cleanup: remove obsolete retry-related classes s3_client: remove unused `filler_exception` s3_client: fix indentation s3_client: simplify chunked download error handling using `make_request` s3_client: reformat `make_request` functions for readability s3_client: eliminate duplication in `make_request` by using overload s3_client: reformat `make_request` function declarations for readability s3_client: reorder `make_request` and helper declarations s3_client: add `make_request` override with custom retry and error handler s3_client: migrate s3_client to Seastar HTTP client s3_client: fix crash in `copy_s3_object` due to dangling stream s3_client: coroutinize `copy_s3_object` response callback aws_error: handle missing `unexpected_status_error` case s3_creds: use Seastar HTTP client with retry strategy retry_strategy: add exponential backoff to `default_aws_retry_strategy` retry_strategy: introduce Seastar-based retry strategy retry_strategy: update CMake and configure.py for new strategy retry_strategy: rename `default_retry_strategy` to `default_aws_retry_strategy` retry_strategy: fix include retry_strategy: Copied utils/s3/retry_strategy.hh to utils/s3/default_aws_retry_strategy.hh retry_strategy: Copied utils/s3/retry_strategy.cc to utils/s3/default_aws_retry_strategy.cc	2025-10-28 13:08:42 +03:00
Pavel Emelyanov	81f598225e	error_injection: Add template parameter default for in release mode The std::optional<T> inject_parameter(...) method is a template, and in dev/debug modes this parameter is defaulted to std::string_view, but for release mode it's not. This patch makes it symmetrical. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#26706	2025-10-27 16:39:22 +01:00
Ernest Zaslavsky	e8ce49dadf	s3_client: remove unnecessary `co_await` in `make_request` Eliminates a redundant `co_await` by directly returning the `future`, simplifying the control flow without affecting behavior.	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	71ea973ae4	s3 cleanup: remove obsolete retry-related classes Delete `default_retry_strategy` and `retryable_http_client`, no longer used in `s3_client` after recent refactors.	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	d44bbb1b10	s3_client: remove unused `filler_exception` Eliminate the now-obsolete `filler_exception`, rendered redundant by earlier refactors that streamlined error handling in the S3 client.	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	d3c6338de6	s3_client: fix indentation Fix indentation in background download fiber in `chunked_download_source`	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	47704deb1e	s3_client: simplify chunked download error handling using `make_request` Refactor `chunked_download_source` to eliminate redundant exception handling by leveraging the new `make_request` override with custom retry strategy. This streamlines the download fiber logic, improving readability and maintainability.	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	2bc9b205b6	s3_client: reformat `make_request` functions for readability Reformats `make_request` functions with long argument lists to improve readability and comply with formatting guidelines.	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	bf39412f4a	s3_client: eliminate duplication in `make_request` by using overload Removes redundant code in the `make_request` function by invoking the appropriate overload, simplifying logic and improving maintainability.	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	695e70834e	s3_client: reformat `make_request` function declarations for readability Reformats the `make_request` function declarations to improve readability due to the large number of arguments. This aligns with our formatting guidelines and makes the code easier to maintain.	2025-10-23 15:58:11 +03:00
Ernest Zaslavsky	9f01c1f3ff	s3_client: reorder `make_request` and helper declarations Performs minor reordering of helper functor declarations in the header file to improve readability and maintain logical grouping.	2025-10-23 15:58:10 +03:00
Ernest Zaslavsky	3d51124cb0	s3_client: add `make_request` override with custom retry and error handler Introduce an override for `make_request` in `s3_client` to support custom retry strategies and error handlers, enabling flexibility beyond the default client behavior and improving control over request handling	2025-10-23 15:58:10 +03:00
Ernest Zaslavsky	bdb3979456	s3_client: migrate s3_client to Seastar HTTP client Eliminate use of `retryable_http_client` in `s3_client` and adopt Seastar's native HTTP client.	2025-10-23 15:58:10 +03:00
Ernest Zaslavsky	2025760e75	s3_client: fix crash in `copy_s3_object` due to dangling stream In the `copy_part` method, move the `input_stream<char>` argument into a local variable before use. Failing to do so can lead to a SIGSEGV or trigger an abort under address sanitizer.	2025-10-23 15:58:10 +03:00
Ernest Zaslavsky	0983c791e9	s3_client: coroutinize `copy_s3_object` response callback coroutinize `copy_s3_object` response callback for a bugfix in the following commit to prevent failing on dangling stream	2025-10-23 15:58:10 +03:00
Ernest Zaslavsky	237217c798	aws_error: handle missing `unexpected_status_error` case Add a missing `case` clause to the `switch` statement to correctly handle scenarios where `unexpected_status_error` is thrown. This fixes overlooked error handling and improves robustness.	2025-10-23 15:58:10 +03:00
Ernest Zaslavsky	4f6384b1a0	s3_creds: use Seastar HTTP client with retry strategy In AWS credentials providers, replace `retryable_http_client` with Seastar's native HTTP client. Integrate the newly added `default_aws_retry_strategy` to handle retries more efficiently and reduce dependency on external retry logic.	2025-10-23 15:58:07 +03:00
Ernest Zaslavsky	3851ee58d7	retry_strategy: add exponential backoff to `default_aws_retry_strategy` Add exponential backoff to `default_aws_retry_strategy` and call it to `sleep` before returning `true`, no-op in case of non-retryable error	2025-10-23 15:49:34 +03:00
Ernest Zaslavsky	524737a579	retry_strategy: introduce Seastar-based retry strategy Add a new class derived from Seastar's `default_retry_strategy`. Relocate the `should_retry` implementation from Scylla's `default_retry_strategy` into the new class to centralize and standardize retry behavior.	2025-10-23 15:49:34 +03:00
Ernest Zaslavsky	51aadd0ab3	retry_strategy: update CMake and configure.py for new strategy Include `default_aws_retry_strategy` in the build system by updating CMake and `configure.py` to ensure it is properly compiled and linked.	2025-10-23 15:49:34 +03:00
Ernest Zaslavsky	5d65b47a15	retry_strategy: rename `default_retry_strategy` to `default_aws_retry_strategy` Renames the `default_retry_strategy` class to `default_aws_retry_strategy` to clarify its association with the S3 client implementation. This avoids confusion with the unrelated `seastar::default_retry_strategy` class.	2025-10-23 15:49:34 +03:00
Ernest Zaslavsky	cc200ced67	retry_strategy: fix include Fix header inclusion in "newly" created file	2025-10-23 15:49:34 +03:00
Ernest Zaslavsky	d679fd514c	retry_strategy: Copied utils/s3/retry_strategy.hh to utils/s3/default_aws_retry_strategy.hh	2025-10-23 15:49:34 +03:00
Ernest Zaslavsky	7cd4be4c49	retry_strategy: Copied utils/s3/retry_strategy.cc to utils/s3/default_aws_retry_strategy.cc	2025-10-23 15:49:34 +03:00
Avi Kivity	ab488fbb3f	Merge 'Switch to seastar API level 9 (no more packet-s in output_stream/data_sink API)' from Pavel Emelyanov Other than patching Scylla sinks to implement new data_sink_impl::put(std::span<temporary_buffer>) overload, the PR changes transport write_response() method to stop using output_stream::write(scattered_message) because it's also gone. Using newer seastar API, no need to backport Closes scylladb/scylladb#26592 * github.com:scylladb/scylladb: code: Fix indentation after previous patch code: Switch to seastar API level 9 transport: Open-code invoke_with_counting into counting_data_sink::put transport: Don't use scattered_message utils: Implement memory_data_sink::put(net::packet)	2025-10-22 01:51:43 +03:00
Botond Dénes	fbceb8c16b	Merge 's3_client: handle failures which require http::request updating' from Ernest Zaslavsky Apply two main changes to the s3_client error handling 1. Add a loop to s3_client's `make_request` for the case whe the retry strategy will not help since the request itself have to be updated. For example, authentication token expiration or timestamp on the request header 2. Refine the way we handle exceptions in the `chunked_download_source` background fiber, now we carry the original `exception_ptr` and also we wrap EVERY exception in `filler_exception` to prevent retry strategy trying to retry the request altogether Fixes: https://github.com/scylladb/scylladb/issues/26483 Should be ported back to 2025.3 and 2025.4 to prevent deadlocks and failures in these versions Closes scylladb/scylladb#26527 * github.com:scylladb/scylladb: s3_client: tune logging level s3_client: add logging s3_client: improve exception handling for chunked downloads s3_client: fix indentation s3_client: add max for client level retries s3_client: remove `s3_retry_strategy` s3_client: support high-level request retries s3_client: just reformat `make_request` s3_client: unify `make_request` implementation	2025-10-21 10:40:38 +03:00
Ernest Zaslavsky	fdd0d66f6e	s3_client: tune logging level Change all logging related to errors in `chunked_download_source` background download fiber to `info` to make it visible right away in logs.	2025-10-20 17:12:59 +03:00
Ernest Zaslavsky	4497325cd6	s3_client: add logging Add logging for the case when we encounter expired credentials, shouldnt happen but just in case	2025-10-20 17:12:59 +03:00
Ernest Zaslavsky	1d34657b14	s3_client: improve exception handling for chunked downloads Refactor the wrapping exception used in `chunked_download_source` to prevent the retry strategy from reattempting failed requests. The new implementation preserves the original `exception_ptr`, making the root cause clearer and easier to diagnose.	2025-10-20 17:12:59 +03:00
Ernest Zaslavsky	58a1cff3db	s3_client: fix indentation Reformat `client::make_request` to fix the indentation of `if` block	2025-10-20 17:12:59 +03:00
Ernest Zaslavsky	43acc0d9b9	s3_client: add max for client level retries To prevent client retrying indefinitely time skew and authentication errors add `max_attempts` to the `client::make_request`	2025-10-20 17:12:59 +03:00
Ernest Zaslavsky	116823a6bc	s3_client: remove `s3_retry_strategy` It never worked as intended, so the credentials handling is moving to the same place where we handle time skew, since we have to reauthenticate the request	2025-10-20 17:12:59 +03:00
Ernest Zaslavsky	185d5cd0c6	s3_client: support high-level request retries Add an option to retry S3 requests at the highest level, including reinitializing headers and reauthenticating. This addresses cases where retrying the same request fails, such as when the S3 server rejects a timestamp older than 15 minutes.	2025-10-20 17:12:59 +03:00
Pavel Emelyanov	44ed3bbb7c	Merge 'RFC: Initial GCP storage backend for scylla (sstables + backup)' from Calle Wilund Integrates GCP object storage as a working storage backend for scylla sstables as well as backup storage. Adds an abstraction layer (atm very heavily designed around the s3 client interface and usage) to allow the "storage" etc layers of sstable management to pick transparently between "s3" and "gs" providers. This modifies the scylla config such that endpoints can optionally (through a "type" param) ref a GS backend. Similarly with storage_options. Also adds some IO wrapping primitives to make it more feasible to place some logic at a mid level of the implementation stack (such as making networked storage files, ranged reading etc). Test s3 fixture is replaced (where appropriate) with an `object_storage` fixture that multiplexes the test across both backends. Unit tests are duplicated and for the GS versions use a boost test fixture for GCS, default local fake. Fixes #25359 Fixes #26453 Closes scylladb/scylladb#26186 * github.com:scylladb/scylladb: docs::dev::object_storage: Add some initial info on GS storage docs/dev: Add mention of (nested) docker usage in testing.md sstables::object_storage_client: Forward memory limit semaphore to GS instance utils::gcp::object_storage: Add optional memory limits to up/download sstables::object_storage_client: Add multi-upload support for GS utils::gcp::storage: Add merge objects operation test_backup/test_basic: Make tests multiplex both s3 and gs backends test::cluster::conftest: Add support for multiple object storage backends boost::gcs_storage_test: reindent boost::gcs_storage_test: Convert to use fixture tests::boost: Add GS object storage cases to mirror S3 ones tests::lib::gcs_fixture: Add a reusable test fixture for real/fake GS/GCS tests::lib::test_utils: Add overloads/helpers for reading and (temp) writing env sstables::object_storage_client: Add google storage implementation test_services: Allow testing with GS object storage parameters utils::gcp::gcp_credentials: Add option to create uninitialized credentials utils::gcp::object_storage: Make create_download_source return seekable_data_source utils::gcp::object_storage: Add defensive copies of string_view params utils::gcp::object_storage: Add missing retry backoff increate utils::gcp::object_storage: Add timestamp to object listing utils::gcp::object_storage: Add paging support to list_objects object_storage_client: Add object_name wrapper type utils::gcp::object_storage: Add optional abort_source utils::rest::client: Add abort_source support sstables: Use object_storage_client for remote storage sstables::object_storage_client: Add abstraction layer for OS cliens (s3 initial) s3::upload_progress: Promote to general util type storage_options: Abstract s3 to "object_storage" and add gs as option sstables::file_io_extension: Change "creator" callback to just data_source utils::io-wrappers: Add ranged data_source utils::io-wrappers: Add file wrapper type for seekable_source utils::seekable_source: Add a seekable IO source type object_storage_endpoint_param: Add gs storage as option config: break out object_storage_endpoint_param preparing for multi storage	2025-10-20 13:14:53 +03:00
Ernest Zaslavsky	db1ca8d011	s3_client: just reformat `make_request` Just reformat previously changed methods to improve readability	2025-10-20 10:44:37 +03:00
Pavel Emelyanov	7d0722ba5c	code: Fix indentation after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-10-17 10:26:50 +03:00
Pavel Emelyanov	a88a36f5b5	code: Switch to seastar API level 9 In the new API the biggest change is to implement the only data_sink_impl::put(span<temporary_buffer>) overload. Encrypted file impl and sstables compress sink use fallback_put() helper that generates a chain of continuations each holding a buffer. The counting_data_sink in transport had mostly been patched to correct implementation by the previous patch, the change here is to replace vector argument with span one. Most other sinks just re-implement their put(vector<temporary_buffer>) overload by iterating over span and non-preemptively grabbing buffers from it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-10-17 10:26:50 +03:00
Pavel Emelyanov	d9808fafdb	utils: Implement memory_data_sink::put(net::packet) It's going to be removed by next-after-next patch, but the next one needs this overload implemented properly, so here it is. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-10-17 10:17:08 +03:00
Ernest Zaslavsky	55fb2223b6	s3_client: unify `make_request` implementation Refactor `make_request` to use a single core implementation that handles authentication and issues the HTTP request. All overloads now delegate to this unified method.	2025-10-16 15:51:28 +03:00
Ernest Zaslavsky	413739824f	s3_client: track memory starvation in background filling fiber Introduce a counter metric to monitor instances where the background filling fiber is blocked due to insufficient memory in the S3 client. Closes scylladb/scylladb#26466	2025-10-14 11:22:54 +03:00
Łukasz Paszkowski	125bf391a7	utils/directories: ignore files when retrieving stats fails During Scylla startup, directories are created and verified in `directories::do_verify_owner_and_mode()`. It is possible that while retrieving file stats, a file might be removed, leading to Scylla failing to boot. This is particularly visible in `storage/test_out_of_space.py` tests, which use FUSE to mount size-limited volumes. When a file that is open by another process is removed, FUSE renames it to `.fuse_hidden*`. In `directories::do_verify_owner_and_mode()`, the code performs a `scan_dir` to list files and retrieves their stats to verify type, mode, and ownership. If a file is removed while retrieving its stats, we see errors such as: ``` Failed to get /scylladir/testlog/x86_64/dev/volumes/e0125c60-1e63-4330-bf6f-c0ea3e466919/scylla-0/hints/1/.fuse_hidden0000001800000005 ``` This change makes `do_verify_owner_and_mode()` ignore files when retrieving stats fails, avoiding spurious errors during verification. Refs: https://github.com/scylladb/scylladb/issues/26314 Closes scylladb/scylladb#26535	2025-10-13 20:41:25 +03:00
Calle Wilund	01f4dfed84	utils::gcp::object_storage: Add optional memory limits to up/download Adds optional memory semaphore to limit the mem buffer usage in sink/source. Note that we don't bookkeep exact, to avoid deadlock issues in higher layer. In upload, we overlease on first buffer put to ensure we can at least fill the desired 8M of buffers. We try to adjust when going over, but if we fail, we fail, but at least will initiate upload -> soon release memory. On next put, we try to grab multiples of 8M again, and so forth. Thus potentially causing waiting for resources, without ending up not uploading at least one active sink. For download (source), we try to get lease for as much as we want to read, but if we fail, we adjust this down to 256k and download anyway. Since this will typically be released immediately, we at least don't overrun for long, and again, avoid fully stopping, throttling rate instead.	2025-10-13 08:53:27 +00:00
Calle Wilund	bd1304972c	utils::gcp::storage: Add merge objects operation Allows merging 1-32 smaller files into a destination.	2025-10-13 08:53:27 +00:00
Calle Wilund	da7099a56e	utils::gcp::gcp_credentials: Add option to create uninitialized credentials To avoid having to async wait for creating credentials, allow lazy init (in actual token renew) of credentials. This is not super pleasant, since it means any error will be late, but it is required more or less for the code paths into which we intend to place this.	2025-10-13 08:53:26 +00:00
Calle Wilund	fd13ffd95d	utils::gcp::object_storage: Make create_download_source return seekable_data_source Since, given the nature of object storage API:s, it is no more complicated to provide a reasonable implementation of a seekable, limited, interface, give this back, which in turn means upper layers can provide easy read-only file interfaces. Hint hint.	2025-10-13 08:53:26 +00:00

1 2 3 4 5 ...

2121 Commits