scylladb

Author	SHA1	Message	Date
Pavel Emelyanov	95809a3ed1	Update seastar submodule * seastar 5b95d1d7...412d058c (62): > fstream: Export functions for making file_data_source > build: Include DPDK dependency libraries in Seastar linkage > demos/tls_echo_server_demo: Modernize with seastar::async > http/client: Pass abort source by pointer > rpc: remove deprecated logging function support > github: Add Alpine Linux workflow to test builds with musl libc > exception_hacks: Make dl_iterate_phdr resolution manual > tests: relax test_file_system_space check for empty filesystems > demos/udp_server_demo: Modernize with seastar::async and proper teardown > future: remove deprecated functions/concepts > util: logger: remove deprecated set_stdout_enabled and logger_ostream_type::{stdout,stderr} > memory: guard __GLIBC_PREREQ usage with __GLIBC__ check > scheduling_specific: Add noexcept wrapper for free() > file: Replace __gid_t with standard POSIX gid_t > aio_storage_context: Use reactor::do_at_exit() > json2code: support chunked_fifo > json: remove unused headers > httpd: test cases for streaming > build: use find_dependency() instead find_package() in config file > build: stop using a loop for finding dependencies > dns: Fix event processing to work safely with recent c-ares > tutorial: add a section about initialization and cleanup > reactor: deprecate at_exit() > httpclient: Add exception handling to connection::close > file: document max_length-limits for dma_read/write funcs taking vector<iovec> > build: fix P2582R1 detection in GCC compatibility check > json2code: optimize string handling using std::string_view > tests/unit: fix typo in test output > doc: Update documentation after removing build.sh > test: Add direct exception passing for awaits for perf test > github: add Docker build verification workflow > docker: update LLVM debian repo for Ubuntu Orcular migration > tests/unit: Use http.HTTPStatus constants instead of raw status codes > tests/unit: Fix exception verification in json2code_test.py > httpd: handle streaming results in more handlers > json: stream_object now moves value > json: support for rvalue ranges > chunked_fifo: make copyable > reactor: deprecate at_destroy() > testing: prevent test scheduling after reactor exit > net: Add bytes sent/received metrics > net: switch rss_key_type to std::span instead of std::string_view > log: fixes for libc++ 19 > sstring: fixes for lib++ 19 > build: finalize numactl dependency removal > build: link DPDK against libnuma when detected during build > memory: remove libnuma dependency > treewide: replace assert with SEASTAR_ASSERT > future: fix typo in comment > http: Unwrap nested exceptions to handle retryable transport errors > net/ip, net: sed -i 's/to_ulong/to_uint/' > core: function_traits noexcept specializations > util/variant: seastar::visit forward value arg > net/tls: fix missing include > tls: Add a way to inspect peer certificate chain > websocket: Extract encode_base64() function > websocket: Rename wlogger to websocket_logger > websocket: Extract parts of server_connection usable for client > websocket: Rename connection to server_connection > websocket: Extract websocket parser to separate file > json2code_test: factor out query method > seastar-json2code: fix error handling Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#23281	2025-03-16 21:57:43 +02:00
Botond Dénes	83ea1877ab	Merge 'scylla-sstable: add native S3 support' from Ernest Zaslavsky scylla-sstable: Enable support for S3-stored sstables Minimal implementation of what was mentioned in this [issue](https://github.com/scylladb/scylladb/issues/20532) This update allows Scylla to work with sstables stored on AWS S3. Users can specify the fully qualified location of the sstable using the format: `s3://bucket/prefix/sstable_name`. One should have `object_storage_config_file` referenced in the `scylla.yaml` as described in docs/operating-scylla/admin.rst ref: https://github.com/scylladb/scylladb/issues/20532 fixes: https://github.com/scylladb/scylladb/issues/20535 No backport needed since the S3 functionality was never released Closes scylladb/scylladb#22321 * github.com:scylladb/scylladb: tests: Add Tests for Scylla-SSTable S3 Functionality docs: Update Scylla Tools Documentation for S3 SSTable Support scylla-sstable: Enable Support for S3 SSTables s3: Implement S3 Fully Qualified Name Manipulation Functions object_storage: Refactor `object_storage.yaml` parsing logic	2025-03-14 15:05:52 +02:00
Pavel Emelyanov	2bb455ec75	Merge 'Main: stop system_keyspace' from Benny Halevy This series adds an async guard to system_keyspace operations and adds a deferred action to stop the system_keyspace in main() before destroying the service. This helps to make sure that sys_ks is unplugged from its users and that all async operations using it are drained once it's stopped. * Enhancement, no backport needed Closes scylladb/scylladb#23113 * github.com:scylladb/scylladb: main: stop system keyspace system_keyspace: call shutdown from stop system_keyspace: shutdown: allow calling more than once database, compaction_manager, large_data_handler: use pluggable<system_keysapce> utils: add class pluggable	2025-03-14 13:23:28 +03:00
Nadav Har'El	3ca2e6ddda	Merge 's3_client: Add retries to Security Token Service/EC2 instance metadata credentials providers' from Ernest Zaslavsky Several updates and improvements to the retryable HTTP client functionality, as well as enhancements to error handling and integration with AWS services, as part of this PR. Below is a summary of the changes: - Moved the retryable HTTP client functionality out of the S3 client to improve modularity and reusability across other services like AWS STS. - Isolated the retryable_http_client into its own file, improving clarity and maintainability. - Added a make_request method that introduces a response-skipping handler. - Introduced a custom error handler constructor, providing greater flexibility in handling errors. - Updated the STS and Instance Metadata Service credentials providers to utilize the new retryable HTTP client, enhancing their robustness and reliability. - Extended the AWS error list to handle errors specific to the STS service, ensuring more granular and accurate error management for STS operations. - Enhanced error handling for system errors returned by Seastar’s HTTP client, ensuring smoother operations. - Properly closed the HTTP client in instance_profile_credentials_provider and sts_assume_role_credentials_provider to prevent resource leaks. - Reduced the log severity in the retry strategy to avoid SCT test failures that occur when any log message is tagged as an ERROR. No backport needed since we dont have any s3 related activity on the scylla side been released Closes scylladb/scylladb#21933 * github.com:scylladb/scylladb: s3_client: Adjust Log Severity in Retry Strategy aws_error: Enhance error handling for AWS HTTP client aws_error: Add STS specific error handling credentials_providers: Close retryable clients in Credentials Providers credentials_providers: Integrate retryable_http_client with Credentials Providers s3_client: enhance `retryable_http_client` functionality s3_client: isolate `retryable_http_client` s3_client: Prepare for `retryable_http_client` relocation s3_client: Remove `is_redirect_status` function s3_client: Move retryable functionality out of s3 client	2025-03-12 10:19:15 +02:00
Avi Kivity	b1d9f80d85	Merge 'tablets: Make load balancing capacity-aware' from Tomasz Grabiec Before this patch, the load balancer was equalizing tablet count per shard, so it achieved balance assuming that: 1) tablets have the same size 2) shards have the same capacity That can cause imbalance of utilization if shards have different capacity, which can happen in heterogeneous clusters with different instance types. One of the causes for capacity difference is that larger instances run with fewer shards due to vCPUs being dedicated to IRQ handling. This makes those shards have more disk capacity, and more CPU power. After this patch, the load balancer equalizes shard's storage utilization, so it no longer assumes that shards have the same capacity. It still assumes that each tablet has equal size. So it's a middle step towards full size-aware balancing. One consequence is that to be able to balance, the load balancer need to know about every node's capacity, which is collected with the same RPC which collects load_stats for average tablet size. This is not a significant set back because migrations cannot proceed anyway if nodes are down due to barriers. We could make intra-node migration scheduling work without capacity information, but it's pointless due to above, so not implemented. Also, per-shard goal for tablet count is still the same for all nodes in the cluster, so nodes with less capacity will be below limit and nodes with more capacity will be slightly above limit. This shouldn't be a significant problem in practice, we could compensate for this by increasing the limit. Refs #23042 Closes scylladb/scylladb#23079 * github.com:scylladb/scylladb: tablets: Make load balancing capacity-aware topology_coordinator: Fix confusing log message topology_coordinator: Refresh load stats after adding a new node topology_coordinator: Allow capacity stats to be refreshed with some nodes down topology_coordinator: Refactor load status refreshing so that it can be triggered from multiple places test: boost: tablets_test: Always provide capacity in load_stats test: perf_load_balancing: Set node capacity test: perf_load_balancing: Convert to topology_builder config, disk_space_monitor: Allow overriding capacity via config storage_service, tablets: Collect per-node capacity in load_stats	2025-03-11 14:34:27 +02:00
Ernest Zaslavsky	c8de7619e5	s3_client: Adjust Log Severity in Retry Strategy * Reduced log severity in retry_strategy. * Rationale: SCT fails tests when any message is logged as ERROR.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	8e46929474	aws_error: Enhance error handling for AWS HTTP client - Seastar's HTTP client is known to throw exceptions for various reasons, including network errors, TLS errors and other transient issues. - Update error handling to correctly capture and process all exceptions from Seastar's HTTP client. - Previously, only aws_exception was handled, causing retryable errors to be missed and `should_retry` not invoked. - Now, all exceptions trigger the appropriate retry logic per the intended strategy. - Add tests for the S3 proxy to ensure robustness and reliability of these enhancements.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	92a12c96a2	aws_error: Add STS specific error handling Updated the AWS error list to include handling for errors specific to the STS service. This enhancement ensures more comprehensive error management for STS-related operations.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	a371d6cf62	credentials_providers: Close retryable clients in Credentials Providers Updated `instance_profile_credentials_provider` and `sts_assume_role_credentials_provider` to close the HTTP client appropriately.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	45a6e88954	credentials_providers: Integrate retryable_http_client with Credentials Providers * Updated STS and Instance Metadata Service credentials providers to utilize retryable_http_client.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	7c49ee4520	s3_client: enhance `retryable_http_client` functionality Enhanced `retryable_http_client` by allowing the injection of a custom error handler through its constructor.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	b589a882bb	s3_client: isolate `retryable_http_client` Relocated `retryable_http_client` into its own dedicated file for improved clarity and maintainability.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	5eff83af95	s3_client: Prepare for `retryable_http_client` relocation Expose `map_s3_client_exception` outside the S3 client class to facilitate moving `retryable_http_client` to a separate file.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	2b3abba10a	s3_client: Remove `is_redirect_status` function Eliminate the `is_redirect_status` function in favor of the equivalent functionality provided by Seastar's HTTP client.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	5b7d4a4136	s3_client: Move retryable functionality out of s3 client This commit moves the retryable HTTP client functionality out of the S3 client implementation. Since this functionality is also required for other services, such as AWS STS, it has been separated to ensure broader applicability.	2025-03-10 09:01:47 +02:00
Ernest Zaslavsky	88c4fa6569	s3: Implement S3 Fully Qualified Name Manipulation Functions Added utility functions to handle S3 Fully Qualified Names (FQN). These functions enable parsing, splitting, and identification of S3 paths, enhancing our ability to work with S3 object storage more effectively.	2025-03-09 09:50:36 +02:00
Robert Bindar	27f2d64725	Remove object storage config credentials provider During development of #22428 we decided that we have no need for `object-storage.yaml`, and we'd rather store the endpoints in `scylla.yaml` and get a REST api to exopose the endpoints for free. This patch removes the credentials provider used to read the aws keys from this yaml file. Followup work will remove the `object-storage.yaml` file altogether and move the endpoints to `scylla.yaml`. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com> Closes scylladb/scylladb#22951	2025-03-07 10:40:58 +03:00
Tomasz Grabiec	d01cc16d1e	config, disk_space_monitor: Allow overriding capacity via config Intended for testing, or hot-fixing out-of-space issues in production. Tablet load balancer uses this information for determining per-shard load so reducing capacity will cause tablets to be migrated away from the node.	2025-03-06 13:35:37 +01:00
Avi Kivity	28906c9261	Merge 'scylla-sstable: introduce the query command' from Botond Dénes The scylla-sstable dump-* command suite has proven invaluable in many investigations. In certain cases however, I found that `dump-data` is quite cumbersome. An example would be trying to find certain values in an sstable, or trying to read the content of system tables when a node is down. For these cases, `dump-data` is very cumbersome: one has to trudge through tons of uninteresting metadata and do compaction in their heads. This PR introduces the new scylla-sstable query command, specifically targeted at situations like this: it allows executing queries on sstables, exposing to the user all the power of CQL, to tailor the output as they see fit. Select everything from a table: $ scylla sstable query --system-schema /path/to/data/system_schema/keyspaces-/-big-Data.db keyspace_name \| durable_writes \| replication -------------------------------+----------------+------------------------------------------------------------------------------------- system_replicated_keys \| true \| ({class : org.apache.cassandra.locator.EverywhereStrategy}) system_auth \| true \| ({class : org.apache.cassandra.locator.SimpleStrategy}, {replication_factor : 1}) system_schema \| true \| ({class : org.apache.cassandra.locator.LocalStrategy}) system_distributed \| true \| ({class : org.apache.cassandra.locator.SimpleStrategy}, {replication_factor : 3}) system \| true \| ({class : org.apache.cassandra.locator.LocalStrategy}) ks \| true \| ({class : org.apache.cassandra.locator.NetworkTopologyStrategy}, {datacenter1 : 1}) system_traces \| true \| ({class : org.apache.cassandra.locator.SimpleStrategy}, {replication_factor : 2}) system_distributed_everywhere \| true \| ({class : org.apache.cassandra.locator.EverywhereStrategy}) Select everything from a single SSTable, use the JSON output (filtered through [jq](https://jqlang.github.io/jq/) for better readability): $ scylla sstable query --system-schema --output-format=json /path/to/data/system_schema/keyspaces-/me-3gm7_127s_3ndxs28xt4llzxwqz6-big-Data.db \| jq [ { "keyspace_name": "system_schema", "durable_writes": true, "replication": { "class": "org.apache.cassandra.locator.LocalStrategy" } }, { "keyspace_name": "system", "durable_writes": true, "replication": { "class": "org.apache.cassandra.locator.LocalStrategy" } } ] Select a specific field in a specific partition using the command-line: $ scylla sstable query --system-schema --query "select replication from scylla_sstable.keyspaces where keyspace_name='ks'" ./scylla-workdir/data/system_schema/keyspaces-/-Data.db replication ------------------------------------------------------------------------------------- ({class : org.apache.cassandra.locator.NetworkTopologyStrategy}, {datacenter1 : 1}) Select a specific field in a specific partition using ``--query-file``: $ echo "SELECT replication FROM scylla_sstable.keyspaces WHERE keyspace_name='ks';" > query.cql $ scylla sstable query --system-schema --query-file=./query.cql ./scylla-workdir/data/system_schema/keyspaces-/-Data.db replication ------------------------------------------------------------------------------------- ({class : org.apache.cassandra.locator.NetworkTopologyStrategy}, {datacenter1 : 1}) New functionality: no backport needed. Closes scylladb/scylladb#22007 github.com:scylladb/scylladb: docs/operating-scylla: document scylla-sstable query test/cqlpy/test_tools.py: add tests for scylla-sstable query test/cqlpy/test_tools.py: make scylla_sstable() return table name also scylla-sstable: introduce the query command tools/utils: get_selected_operation(): use std::string for operation_options utils/rjson: streaming_writer: add RawValue() cql3/type_json: add to_json_type() test/lib/cql_test_env: introduce do_with_cql_env_noreentrant_in_thread()	2025-03-06 13:42:45 +02:00
Tomasz Grabiec	7e7f1e6f91	storage_service, tablets: Collect per-node capacity in load_stats New RPC is introduced becuase load_stats was marked "final" in the IDL. Will be needed by capacity-aware load balancing.	2025-03-06 12:17:32 +01:00
Pavel Emelyanov	86b3e9b50b	code: Move checked-file-impl.hh to util/ fixes: #22100 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#23123	2025-03-06 10:22:05 +02:00
Benny Halevy	13a22cb6fd	utils: add class pluggable A wrapper around a shared service allowing safe plug and unplug of the service from its user using a phased-barrier operation permit guarding the service while in use. Also add a unit test for this class. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-03-05 08:25:50 +02:00
Amnon Heiman	bf39a760aa	utils/logalloc.cc label metrics with basic_level The following metrics will be marked with basic_level label: scylla_lsa_total_space_bytes scylla_lsa_non_lsa_used_space_bytes Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2025-03-03 16:58:38 +02:00
Amnon Heiman	30b34d29b2	Adding the __level and features labels Scylla generates many metrics, and when multiplied by the number of shards, the total number of metrics adds a significant load to a monitoring server. With multi-tier monitoring, it is helpful to have a smaller subset of metrics users care about and allow them to get only those. This patch adds two kind of labels, the a __level label, currently with a single value, but we can add more in the future. The second kind, is a cross feature label, curently for alternator, cdc and cas. We will use the __level label to mark the interesting user-facing metrics. The current level value is: basic - metrics for Scylla monitoring In this phase, basic will mark all metrics used in the dashboards. In practice, without any configuration change, Prometheus would get the same metrics as it gets today. While it is possible to filter by the label, e.g.: curl http://localhost:9180/metrics?__level=basic The labels themselves are not reported thanks to label filtering of labels begin with __. The feature labels: __cdc, __cas and __alternator can be an easy way to disable a set of metrics when not using a feature. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2025-03-03 16:58:38 +02:00
Kefu Chai	c45f9b7155	utils/sorting: Fix VerticesContainer concept constraints Fix a bug where std::same_as<...> constraint was incorrectly used as a simple requirement instead of a nested requirement or part of a conjunction. This caused the constraint to be always satisfied regardless of the actual types involved. This change promotes std::same_as<...> to a top-level constraint, ensuring proper type checking while improving code readability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#23068	2025-02-26 23:23:53 +02:00
Pavel Emelyanov	27e96be6ad	B+tree: Clean const_iterator->iterator conversion The tree code have const and non-const overloads for searching methods like find(), lower_bound(), etc. Not to implement them twice, it's coded like const_iterator find() const { ... // the implementation itself } iterator find() { return iterator(const_cast<const *>(this)->find()); } i.e. -- const overload is called, and returned by it const_iterator is converted into a non-const iterator. For that the latter has dedicated constructor with two inaccuracies: it's not marked as explicit and it accepts const rvalue reference. This patch fixes both. Althogh this disables implicit const -> non-const conversion of iterators, the constructor in question is public, which still opens a way for conversion (without const_cast<>). This constructor is better be marked private, but there's double_decker class that uses bptree and exploits the same hacks in its finding methods, so it needs this constructor to be callable. Alas. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#23069	2025-02-26 23:17:27 +02:00
Kefu Chai	da9960db1c	tree: Fix polymorphic exception handling by using references Replace value-based exception catching with reference-based catching to address GCC warnings about polymorphic type slicing: ``` warning: catching polymorphic type ‘class seastar::rpc::stream_closed’ by value [-Wcatch-value=] ``` When catching polymorphic exceptions by value, the C++ runtime copies the thrown exception into a new instance of the specified type, slicing the actual exception and potentially losing important information. This change ensures all polymorphic exceptions are caught by reference to preserve the complete exception state. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#23064	2025-02-26 23:15:16 +02:00
Kefu Chai	9fdbe0e74b	tree: Remove unused boost headers This commit eliminates unused boost header includes from the tree. Removing these unnecessary includes reduces dependencies on the external Boost.Adapters library, leading to faster compile times and a slightly cleaner codebase. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22997	2025-02-25 10:32:32 +03:00
Kefu Chai	d384b0a63e	utils: use std::to_underlying() when appropriate Use std::to_underlying() when comparing unsigned types with enumeration values to fix type mismatch warnings in GCC-14. This specifically addresses an issue in utils/advanced_rpc_compressor.hh where comparing a uint8_t with 0 triggered a '-Werror=type-limits' warning: ``` error: comparison is always false due to limited range of data type [-Werror=type-limits] if (x < 0 \|\| x >= static_cast<underlying>(type::COUNT)) ~~^~~ ``` Using std::to_underlying() provides clearer type semantics and avoids these kind of comparison warnings. This change improves code readability while maintaining the same behavior. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22898	2025-02-19 12:12:28 +03:00
Botond Dénes	a6caade11d	utils/rjson: streaming_writer: add RawValue() Exposes the RawValue() method of the underlying rapidjson::Writer. This method allows writing a pre-formatted json value to the stream. This will allow using cql3/type_json.hh to pre-format CQL3 types, then write these pre-formatted values into a json stream.	2025-02-17 08:01:38 -05:00
Pavel Emelyanov	b52d1a3d99	s3/client: Make http client connections limit configurable It's now calculated based on sched group shares, but for tests explicit value is needed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-02-14 16:27:25 +03:00
Ernest Zaslavsky	5a266926e5	s3_client: Increase default part size for optimal performance Set the `upload_file` part size to 50MiB, as this value provides the best performance based on tests conducted using `perf_s3_client` on an i4i.4xlarge instance. ./perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 5 INFO 2025-02-06 10:34:08,007 [shard 0:main] perf - Uploaded 1024MB in 27.768863962s, speed 36.87583335786734MB/s ./perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 10 INFO 2025-02-06 10:35:07,161 [shard 0:main] perf - Uploaded 1024MB in 28.175412552s, speed 36.34374467845414MB/s ./perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 20 INFO 2025-02-06 10:35:55,530 [shard 0:main] perf - Uploaded 1024MB in 14.483539631s, speed 70.700949221575MB/s ./perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 30 INFO 2025-02-06 10:36:35,466 [shard 0:main] perf - Uploaded 1024MB in 11.486155799s, speed 89.15080188004683MB/s ./perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 40 INFO 2025-02-06 10:37:46,642 [shard 0:main] perf - Uploaded 1024MB in 10.236196424s, speed 100.03715809898961MB/s /perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 50 INFO 2025-02-06 10:38:34,777 [shard 0:main] perf - Uploaded 1024MB in 9.490644522s, speed 107.895728011548MB/s ./perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 60 INFO 2025-02-06 10:39:08,832 [shard 0:main] perf - Uploaded 1024MB in 9.767783693s, speed 104.83442633295012MB/s ./perf_s3_client --smp 1 --upload --object_name ./1G-test-file --sockets 1 --part_size_mb 70 INFO 2025-02-06 10:39:47,916 [shard 0:main] perf - Uploaded 1024MB in 10.166116742s, speed 100.72675988162482MB/s Closes scylladb/scylladb#22732	2025-02-07 13:49:54 +03:00
Ernest Zaslavsky	97d789043a	s3_client: Fix buffer offset reset on request retry This patch addresses an issue where the buffer offset becomes incorrect when a request is retried. The new request uses an offset that has already been advanced, causing misalignment. This fix ensures the buffer offset is correctly reset, preventing such errors. Closes scylladb/scylladb#22729	2025-02-07 08:52:08 +03:00
Pavel Emelyanov	64baab1b95	Merge 'config: prevent SIGHUP from changing non-liveupdatable parameters' from Andrzej Jackowski Before this change, it was possible to change non-liveupdatable config parameter without process restart. This erroneous behavior not only contradicts the documentation but is potentially dangerous, as various components theoretically might not be prepared for a change of configuration parameter value without a restart. The issue came from a fact that liveupdatability verification check was skipped for default configuration parameters (those without its initial values in configuration file during process start). This change: - Introduce _initialization_completed member in config_file - Set _initialization_completed=true when config file is processed on server start - Verify config_file's initialization status during config update - if config_file was initialized, prevent from further changes of non-liveupdatable parameters - Implement ScyllaRESTAPIClient::get_config() that obtains a current value of given configuration parameter via /v2/config REST API - Implement test to confirm that only liveupdatable parameters are changed when SIGHUP is sent after configuration file change Function set_initialization_completed() is called only once in main.cc, and the effect is expected to be visible in all shards, as a side effect of cfg->broadcast_to_all_shards() that is called shortly after. The same technique was already used for enable_3_1_0_compatibility_mode() call. Fixes scylladb/scylladb#5382 No backport - minor fix. Closes scylladb/scylladb#22655 * github.com:scylladb/scylladb: test: SIGHUP doesn't change non-liveupdatable configuration test: implement ScyllaRESTAPIClient::get_config() config: prevent SIGHUP from changing non-liveupdatable parameters config: remove unused set_value_on_all_shards(const YAML::Node&)	2025-02-06 11:33:59 +03:00
Ernest Zaslavsky	dee4fc7150	aws creds: add STS and Instance Metadata service credentials providers This commit introduces two new credentials providers: STS and Instance Metadata Service. The S3 client's provider chain has been updated to incorporate these new providers. Additionally, unit tests have been added to ensure coverage of the new functionality.	2025-02-05 14:57:19 +02:00
Ernest Zaslavsky	d534051bea	aws creds: add env. and file credentials providers This commit entirely removes credentials from the endpoint configuration. It also eliminates all instances of manually retrieving environment credentials. Instead, the construction of file and environment credentials has been moved to their respective providers. Additionally, a new aws_credentials_provider_chain class has been introduced to support chaining of multiple credential providers.	2025-02-05 14:57:19 +02:00
Andrzej Jackowski	dd899c0f1f	config: prevent SIGHUP from changing non-liveupdatable parameters Before this change, it was possible to change non-liveupdatable config parameter without process restart. This erroneous behavior not only contradicts the documentation but is potentially dangerous, as various components theoretically might not be prepared for a change of configuration parameter value without a restart. The issue came from a fact that liveupdatability verification check was skipped for default configuration parameters (those without its initial values in configuration file during process start). This change: - Introduce _initialization_completed member in config_file - Set _initialization_completed=true when config file is processed on server start - Verify config_file's initialization status during config update - if config_file was initialized, prevent from further changes of non-liveupdatable parameters Fixes scylladb/scylladb#5382	2025-02-05 09:37:30 +01:00
Ernest Zaslavsky	c911fc4f34	s3 creds: move credentials out of endpoint config This commit refactors the way AWS credentials are managed in Scylla. Previously, credentials were included in the endpoint configuration. However, since credentials and endpoint configurations serve different purposes and may have different lifetimes, it’s more logical to manage them separately. Moving forward, credentials will be completely removed from the endpoint_config to ensure clear separation of concerns.	2025-02-04 16:45:23 +02:00
Andrzej Jackowski	fb118bfd3b	config: remove unused set_value_on_all_shards(const YAML::Node&) This change: - Remove unused set_value_on_all_shards(const YAML::Node&) member function in class config_file::named_value The function logic was flawed, in a similar way named_value<T>::set_value(const YAML::Node& node) is flawed: the config source verification is insufficient for liveupdatable parameters, allowing overwriting of non-liveupdatable config parameters (refer to scylladb#5382). As the function was not used, it was removed instead of fixing.	2025-02-04 15:09:23 +01:00
Botond Dénes	9fc14f203b	Merge 'Simplify loading_cache_test and use manual_clock' from Benny Halevy This series exposes a Clock template parameter for loading_cache so that the test could use the manual_clock rather than the lowres_clock, since relying on the latter is flaky. In addition, the test load function is simplified to sleep some small random time and co_return the expected string, rather than reading it from a real file, since the latter's timing might also be flaky, and it out-of-scope for this test. Fixes #20322 * The test was flaky forever, so backport is required for all live versions. Closes scylladb/scylladb#22064 * github.com:scylladb/scylladb: tests: loading_cache_test: use manual_clock utils: loading_cache: make clock_type a template parameter test: loading_cache_test: use function-scope loader test: loading_cache_test: simlute loader using sleep test: lib: eventually: add sleep function param test: lib: eventually: make *EVENTUALLY_EQUAL inline functions	2025-01-27 13:13:41 +01:00
Avi Kivity	a23a3110b5	utils: config_file: forward_declare boost::program_options classes Avoid pulling in boost dependencies when all we need is the class name. Closes scylladb/scylladb#22453	2025-01-27 10:45:43 +03:00
Kefu Chai	769162de91	tree: correct misspellings these misspellings were identified by codespell. let's fix them. one of them is a part of a user visble string. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22443	2025-01-26 15:54:06 +02:00
Kefu Chai	d1c222d9bd	config: specialize config_from_string() for sstring Specialize config_from_string() for sstring to resolve lexical_cast stream state parsing limitation. This enables correct handling of empty string configurations, such as setting an empty value in CQL: ```cql UPDATE system.config SET value='' WHERE name='allowed_repair_based_node_ops'; ``` Previous implementation using boost::lexical_cast would fail due to EOF stream state, incorrectly rejecting valid empty string conversions. Fixes scylladb/scylladb#22491 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22492	2025-01-26 15:53:12 +02:00
Benny Halevy	0841483d68	utils: loading_cache: make clock_type a template parameter So the unit test can use manual_clock rather than lowres_clock which can be flaky (in particular in debug mode). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-01-23 09:28:08 +02:00
Kefu Chai	d5d251da9a	utils: implement drop-in replacement for replacing boost::adaptors::uniqued Add a custom implementation of boost::adaptors::uniqued that is compatible with C++20 ranges library. This bridges the gap between Boost.Range and the C++ standard library ranges until std::views::unique becomes available in C++26. Currently, the unique view is included in [P2214](https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2023/p2760r0.html) "A Plan for C++ Ranges Evolution", which targets C++26. The implementation provides: - A lazy view adaptor that presents unique consecutive elements - No modification of source range - Compatibility with C++20 range views and concepts - Lighter header dependencies compared to Boost This resolves compilation errors when piping C++20 range views to boost::adaptors::uniqued, which fails due to concept requirements mismatch. For example: ```c++ auto range = std::views::take(n) \| boost::adaptors::uniqued; // fails ``` This change also offers us a lightweight solution in terms of smaller header dependency. While std::ranges::unique exists in C++23, it's an eager algorithm that modifies the source range in-place, unlike boost::adaptors::uniqued which is a lazy view. The proposed std::views::unique (P2214) targeting C++26 would provide this functionality, but is not yet available. This implementation serves as an interim solution for filtering consecutive duplicate elements using range views until std::views::unique is standardized. For more details on the differences between `std::ranges::unique` and `boost::adaptors::uniqued`: - boost::adaptors::uniqued is a view adaptor that creates a lazy view over the original range. It: * Doesn't modify the source range * Returns a view that presents unique consecutive elements * Is non-destructive and lazy-evaluated * Can be composed with other views - std::ranges::unique is an algorithm that: * Modifies the source range in-place * Removes consecutive duplicates by shifting elements * Returns an iterator to the new logical end * Cannot be used as a view or composed with other range adaptors Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2025-01-21 16:24:45 +08:00
Kefu Chai	4a5a00347f	utils: do not include unused headers these unused includes were identifier by clang-include-cleaner. after auditing these source files, all of the reports have been confirmed. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22201	2025-01-17 11:24:54 +03:00
Avi Kivity	d6f7f873d0	utils: config_file: don't use extern fully specialized variable templates Declaring-but-not-defining a fully specialized template is a great way to cut dependencies between users and providers, but unfortunately not supported for variable templates. Clang 18 does support it, but apparently it is a misinterpretation of the standard, and was removed in clang 19. We started using this non-feature in `7ed89266b3`. The fix is to use function templates. This is more verbose as each specialization needs to define a static variable to return, but is fully supported. Closes scylladb/scylladb#22299	2025-01-17 11:06:50 +03:00
Kefu Chai	7215d4bfe9	utils: do not include unused headers these unused includes were identifier by clang-include-cleaner. after auditing these source files, all of the reports have been confirmed. please note, because quite a few source files relied on `utils/to_string.hh` to pull in the specialization of `fmt::formatter<std::optional<T>>`, after removing `#include <fmt/std.h>` from `utils/to_string.hh`, we have to include `fmt/std.h` directly. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2025-01-14 07:56:39 -05:00
Botond Dénes	686a997c04	Merge 'Complete implementation of configuring IO bandwidth limits' from Pavel Emelyanov In Scylla there are two options that control IO bandwidth limit -- the /storage_service/(compaction\|stream)_throughput REST API endpoints. The endpoints are partially implemented and have no counterparts in the nodetool. This set implements the missing bits and adds tests for new functionality. Closes scylladb/scylladb#21877 * github.com:scylladb/scylladb: nodetool: Implement [gs]etstreamthroughput commands nodetool: Implement [gs]etcompationthroughput commands test: Add validation of how IO-updating endpoints work api: Implement /storage_service/(stream\|compaction)_throughput endpoints api: Disqualify const config reference api: Implement /storage_service/stream_throughput endpoint api: Move stream throughput set/get endpoints from storage service block api: Move set_compaction_throughput_mb_per_sec to config block util: Include fmt/ranges.h in config_file.hh	2025-01-14 07:56:38 -05:00
Avi Kivity	814942505f	Merge 'Introduce Encryption-at-Rest (EAR) for sstables and commitlog' from Calle Wilund Fixes https://github.com/scylladb/scylla-enterprise/issues/5016#issuecomment-2558464631 EAR - encryption at rest. Allows on-disk file encryption of sstables and commitlog data. Introduces OpenSSL based file level encrypted storage, managed via a set of providers ranging from local files to cloud KMS providers. For a more comprehensive explanation, see the included docs (or if possible, original source tree). Manual bulk merge of EAR feature from enterprise repo to main scylla repo. Breaks some features apart, but main EAR is still a humongous commit, because to separate this I would have to mess with code incrementally, adding time and risk. This PR includes the local file gen tool, tests and also p11 validation. Note: CI will not execute the full tests unless master CI is set to provide the same environment as the enterprise one. Not sure about the status of this ATM. Note: Includes code to compile against cryptsoft kmipc SDK, but not the SDK. If you happen to check out this tree in the scylla folder and configure, it will be linked against and KMIP functionality will be enabled, otherwise not. Closes scylladb/scylladb#22233 * github.com:scylladb/scylladb: docs: Add EAR docs main/build: Add p11-kit and initialize tools: Add local-file-key-generator tool tests: Add EAR tests tmpdir: shorten test tempdir path EAR: port the ear feature from enterprise cql_test_env: Add optional query timeout schema/migration_manager: Add schema validate sstables: add get_shared_components accessor config/config_file: Add exports and definitions of config_type_for<>	2025-01-12 16:10:46 +02:00

1 2 3 4 5 ...

1880 Commits