scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 12:36:56 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	5c93e12373	test: util: Introduce ensure_group0_leader_on() Many tests want to assume that group0 leader runs on a particualr server, typically the first server in the list. And they cannot be easily made to work with arbitrary leader, becuase they setup a particular topology and then stop particular nodes, and want to assume the leader is stable. They open leader's log and expect things to appear in that log. It's much easier to ensure the leader, than to prepare tests to handle failovers.	2026-01-18 15:36:07 +01:00
Tomasz Grabiec	478b8f09df	test: tablets: Check that there are no migrations scheduled on draining nodes In case of decommission, it's not desirable because it's less urgent. In case of removenode, it leads to failure of removenode operation because scheduled co-locating migration will fail if the destination is on the excluded node, and this failure will be interpreted as drain failure and coordinator will cancel the request. Not a problem before "parallel decommission" because this failure is only a streaming failure, not a barrier failure, so exception doesn't escape into the catch clause in transition stage handler, and the migration is simply rolled back. Once draining happens in the tablet migration track, streaming failure will be interpreted as drain failure and cancel the request.	2026-01-18 15:36:07 +01:00
Tomasz Grabiec	e082e32cc7	test: lib: topology_builder: Introduce add_draining_request()	2026-01-18 15:36:07 +01:00
Tomasz Grabiec	baea12c9cb	topology_coordinator, tablets: Fail draining operations when tablet migration fails due to critical disk utilization Reaching critical disk utilization on destination means the draining either caused it, or at least works against reliveing it. So it's better to cancel those requests. In case of decommission, if critical disk utilization was caused by it due to not enough capacity, aborting decomission will bring capacity back to the system and rebalancing will relieve critical disk utlization.	2026-01-18 15:36:07 +01:00
Tomasz Grabiec	1b784e98f3	tablets: topology_coordinator: Refactor to propagate reason for migration rollback Will be easier to implement later change to cancel topology request, where we need to give a reason for doing so.	2026-01-18 15:36:07 +01:00
Tomasz Grabiec	2d954f4b19	tablet_allocator: Skip co-location on draining nodes In case of decommission, it's not desirable because it's less urgent. In case of removenode, it leads to failure of removenode operation because scheduled co-locating migration will fail if the destination is on the excluded node, and this failure will be interpreted as drain failure and coordinator will cancel the request. Not a problem before "parallel decommission" because this failure is only a streaming failure, not a barrier failure, so exception doesn't escape into the catch clause in transition stage handler, and the migration is simply rolled back. Once draining happens in the tablet migration track, streaming failure will be interpreted as drain failure and cancel the request.	2026-01-18 15:36:06 +01:00
Tomasz Grabiec	d9e1a6006f	node_ops: task_manager_module: Populate entity field also for active requests	2026-01-18 15:36:06 +01:00
Tomasz Grabiec	bbd293d440	tasks: node_ops: Put node id in the entity field If we have many node requests active at a time, it's useful to know which requets works on which node. Fixes #27208	2026-01-18 15:36:06 +01:00
Tomasz Grabiec	576ebcdd30	tasks, node_ops: Unify setting of task_stats in get_status() and get_stats() They should return the same, so extract the common logic.	2026-01-18 15:36:05 +01:00
Tomasz Grabiec	629d6d98fa	topology: Protect against empty cancelation reason Request would be deemed successful, which is counter to the intention of cancelation and effect on the system.	2026-01-18 15:36:05 +01:00
Tomasz Grabiec	7446eb7e8d	tasks, topology: Make pending node operations abortable We want to be able to cancel decommission when it's still in the tablet draining phase. Such a request is in a pending and paused state, and can be safely canceled. We set the node's "draining" flag back to false.	2026-01-18 15:36:05 +01:00
Tomasz Grabiec	091ed4d54b	doc: topology-over-raft.md: Fix diagram for replacing, tablet_draining is not engaged Since `288e75fe22`	2026-01-18 15:36:05 +01:00
Tomasz Grabiec	a009644c7d	raft_topology, tablets: Drain tablets in parallel with other topology operations Allows other topology operations to execute while tablets are being drained on decommission. In particular, bootstrap on scale-out. This is important for elasticity. Allows multiple decommission/removenode to happen in parallel, which is important for efficiency. Flow of decommission/removenode request: 1) pending and paused, has tablet replicas on target node. Tablet scheduler will start draining tablets. 2) No tablets on target node, request is pending but not paused 3) Request is scheduled, node is in transition 4) Request is done Nodes are considered draining as soon as there is a leave or remove request on them. If there are tablet replicas present on the target node, the request is in a paused state and will not be picked by topology coordinator. The paused state is computed from topology state automatically on reload. When request is not paused, its execution starts in write_both_read_old state. The old tablet_draining state is not entered (it's deprecated now). Tablet load balancing will yield the state machine as soon as some request is no longer paused and ready to be scheduled, based on standard preemption mechanics. The test case test_explicit_tablet_movement_during_decommission is removed. It verifies that tablet move API works during tablet draining transition. After this PR, we no longer enter this transition, so the test doesn't work. It loses its purpose, because movement during normal tablet balancing is not special and tested elsewhere.	2026-01-18 15:36:05 +01:00
Tomasz Grabiec	e38ee160fc	virtual_tables: Show draining and excluded fields in system.cluster_status and system.load_by_node It gives a more accurate picture of what happens in the cluster.	2026-01-18 15:36:04 +01:00
Tomasz Grabiec	1c2e47e059	locator: topology: Add "draining" flag to a node They are being drained of tablet replicas, tablet scheduler works to move replicas away from such nodes. This state is set at the beginning of decommission and removenode operations.	2026-01-18 15:36:04 +01:00
Tomasz Grabiec	a37b1ce832	topology_coordinator: Extract generate_cancel_request_update()	2026-01-18 15:36:04 +01:00
Tomasz Grabiec	77bd00bf9f	storage_service: Drop dependency in topology_state_machine.hh in the header To reduce compilation time.	2026-01-18 15:36:04 +01:00
Tomasz Grabiec	a24c3fc229	locator: Extract common code in assert_rf_rack_valid_keyspace()	2026-01-18 15:36:04 +01:00
Tomasz Grabiec	d3ee82ea51	topology_coordinator, storage_service: Validate node removal/decommission at request submission time After parallel tablet draining, the validation at the time request starts executing is too late, tablets will be already drained. This trips tests which expect validation failure, but get tablet draining failure instead. Also, in case of decommission, it's a waste to go through draining only to discover that the operation has to be rolled back due to validation. So avoid submitting a request altogether if it's invalid. The validation at request execution start remains, for extra sefety. validate_removing_node() was extracted out of topology_coordinator, so that it can be called by storage_service on non-coordinator. Some tests need adjusting for the fact that after failed removenode the node may still not be marked as excluded, so we need to explicitly exclude it or add to the list of ignored nodes in the next removenode operation.	2026-01-18 15:36:04 +01:00
Andrzej Jackowski	6eca7e4ff6	transport: unify lambda capture lifetime for control connections Workload prioritization was added in scylladb/scylladb#22031. The functionality of updating service levels was implemented as a lambda coroutine, leaving room for the lambda coroutine fiasco. The problem was noticed and addressed in scylladb/scylladb#26404. There are currently three functions that call switch_tenant: - update_user_scheduling_group_v1 and update_user_scheduling_group_v2 use the deducing this (this auto self) to ensure the proper lifecycle of the lambda capture. - update_control_connection_scheduling_group doesn’t use the deducing this, but the lambda captures only `this`, which is used before the first possible coroutine preemption. Therefore, it doesn’t seem that any memory corruption or undefined behavior is possible here. Nevertheless, it seems better to start using the deducing this in update_control_connection_scheduling_group as well, to avoid problems in the future if someone modifies the code and forgets to add it. Fixes: SCYLLADB-284 Closes scylladb/scylladb#28158	2026-01-17 20:36:31 +02:00
Nikos Dragazis	8aca7b0eb9	test: database_test: Fix serialization of partition key The `make_key` lambda erroneously allocates a fixed 8-byte buffer (`sizeof(s.size())`) for variable-length strings, potentially causing uninitialized bytes to be included. If such bytes exist and they are not valid UTF-8 characters, deserialization fails: ``` ERROR 2026-01-16 08:18:26,062 [shard 0:main] testlog - snapshot_list_contains_dropped_tables: cql env callback failed, error: exceptions::invalid_request_exception (Exception while binding column p1: marshaling error: Validation failed - non-UTF8 character in a UTF8 string, at byte offset 7) ``` Fixes #28195. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#28197	2026-01-17 20:32:06 +02:00
Botond Dénes	1e09a34686	replica: add abort polling to memtable and cache readers Continuing the read once it is aborted (e.g. due to timeout) is a waste of resources, as the produced results will be discarded. Poll the permit's abort exception in the memtable and cache reader's fill_buffer(). This results in one poll per buffer filled (8KB of data). We already have similar poll for sstable readers, as disk reads are usually much heavier and therefore it is more important to stop them ASAP after abort. Cache and memtable reads are usually quick but not always, hence it is important to also have polling in the cache and memtable readers. Refs: #11469 Fixes: #28148 Closes scylladb/scylladb#28149	2026-01-16 18:03:04 +01:00
Ferenc Szili	0aebc17c4c	docs: correct spelling errors in size based balancing docs `0ede8d154b` introduced the dev doc for size based load balancing, but also added spelling errors. This PR fixed these errors. Closes scylladb/scylladb#28196	2026-01-16 17:41:57 +02:00
Patryk Jędrzejczak	eb7be9010d	Merge 'topology_coordinator: Refresh load stats after table is created or altered' from Tomasz Grabiec We switched to the size-based load balancing, which now has more strict requirements for load stats. We no longer need only per-node stats, but also per-tablet stats. Bootstrapping a node triggers stats refresh, but allocating tablets on table creation didn't. So after creating a table, load balancer couldn't make progress for up to 60s (stats refresh period). This makes tests take longer, and can even cause failures if tests are using a low-enough timeout. Fixes https://github.com/scylladb/scylladb/issues/27921 No backport becuse only master is vulnerable (size-based load balancing). Closes scylladb/scylladb#27926 * https://github.com/scylladb/scylladb: test: cluster: Add reproducer for missed notification in topology coordinator topology_coordinator: Wake up the state machine after stats refresh topology_coordinator: Move tablet_load_stats_refresh_before_rebalancing injection earlier topology_coordinator: Fix potential missed notification topology_coordinator: Refresh load stats after table is created or altered tablets: Do a group0 read barrier on tablet load stats refresh topology_coordinator: Ensure stats are refreshed in the gossip scheduling group test: Use ManagerClient.{disable,enable}_tablet_balancing() test: Add missing calls to disable_tablet_balancing() in tests which use move_tablet() API test: pylib: Introduce ManagerClient.{disable,enable}_tablet_balancing()	2026-01-16 11:34:57 +01:00
Tomasz Grabiec	3fb7719277	topology_coordinator: Update load stats in case rebuilding with no live replica Such rebuild has no read_from replica, but we know the tablet size will be 0. If we don't, stats will be incomplete until the next refresh. This is important for test cases which do removenode or replace while all replicas are down. So for example test_replace from test_tablets_removenode.py, which uses RF=1 and replaces a node. Without this, the test waits for 60s needlessly after the first round of rebuilding migrations before scheduling more migrations. This can cause the test to time out. Fixes #28115 Closes scylladb/scylladb#28121	2026-01-16 11:19:01 +02:00
Sergey Zolotukhin	799d837295	test: disable test_start_bootstrapped_with_invalid_seed The test intermittently fails when an invalid DNS name is resolved, likely due to ISP DNS error hijacking (see scylladb/scylladb#28153). Disable this test to unblock CI. Fixes scylladb/scylladb#28153 Closes scylladb/scylladb#28162	2026-01-15 10:25:45 +01:00
Jenkins Promoter	51d61f809e	Update pgo profiles - aarch64	2026-01-15 05:13:03 +02:00
Jenkins Promoter	eed1e7fa23	Update pgo profiles - x86_64	2026-01-15 04:33:43 +02:00
Tomasz Grabiec	eef798d84f	Merge 'Distribute data evenly among primary replicas during restore' from Robert Bindar Most likely `817fdad` uncovered the fact that our choice of primary replica was resonating with tablet allocation and we were ending up picking the same replica as primary within a scope instead of rotating primaryship among all replicas in the scope. This created situations where for instance, restoring into a 9 nodes with primary_replica_only=true would put all data into 3 nodes, leaving the other 6 unused. The balancing of the dataset was performed by the subsequent repair step. This PR fixes this by changing the formula for picking up the primary replica out of a set of eligible replicas from within the passed scope. The PR also extends the testing scenarios in `test_backup.py` so we get to run restore for a set of topologies, for all combinations of scope, primary_replica_only and min_tablet_counts. Most of the work was done by @bhalevy [here](https://github.com/scylladb/scylladb/compare/master...bhalevy:scylla:load-balance-primary-replica), this PR just splitted it and did touchups here and there. Fixes #27281 Closes scylladb/scylladb#27397 * github.com:scylladb/scylladb: test: reduce dataset and number of test cases or debug builds test: bump repair timeout up, it's sometimes not enough in CI test: refactor test_refresh.py to match test_restore_with_streaming_scopes. test: extend test_restore_with_streaming_scopes test: Adjust test_restore_primary_replica_different_dc_scope_all test: Refactor restoring code in test_backup to match SM pattern test: add check_mutation_replicas calls after fresh creation of dataset test: extend create_dataset to accept consistency_level test: refactor check_mutation_replicas so it's more readable test: make create_dataset async and refactor so it's configurable test: use defaultdict in collect_mutations test: add log marks to facilitate reusing server for restore locator: tablets: Distribute data evenly among primary replicas during restore	2026-01-14 18:57:55 +01:00
Avi Kivity	bd08b6e5b2	Merge 'Unify configuration of object storage endpoints (take 2)' from Pavel Emelyanov To configure S3 storage, one needs to do ``` object_storage_endpoints: - name: s3.us-east-1.amazonaws.com port: 443 https: true aws_region: us-east-1 ``` and for GCS it's ``` object_storage_endpoints: - name: https://storage.googleapis.com:433 type: gs credentials_file: <gcp account credentials json file> ``` This PR updates the S3 part to look like ``` object_storage_endpoints: - name: https://s3.us-east-1.amazonaws.com:443 aws_region: us-east-1 ``` fixes: #26570 This is 2nd attempt, previous one (#27360) was reverted because it reported endpoint configs in new format via API and CQL always, even if the endpoint was configured in the old way. This "broke" scylla manager and some dtests. This version has this bug fixed, and endpoints are reported in the same format as they were configured with. About correctness of the changes. No modifications to existing tests are made here, so old format is respected correctly (as far as it's covered by tests). To prove the new format works the the test_get_object_store_endpoints is extended to validate both options. Some preparations to this test to make this happen come on their own with the PR #28111 to show that they are valid and pass before changing the core code. Enhancing the way configuration is made, likely no need to backport. Closes scylladb/scylladb#28112 * github.com:scylladb/scylladb: test: Validate S3 endpoints new format works docs: Update docs according to new endpoints config option format object_storage: Create s3 client with "extended" endpoint name s3/storage: Tune config updating sstable: Shuffle args for s3_client_wrapper test: Rename badconf variable into objconf test: Split the object_store/test_get_object_store_endpoints test	2026-01-14 18:29:03 +02:00
Yaniv Michael Kaul	d919aacc69	storage_proxy: mark write_timeouts metric for counter write timeouts When a counter write times out (due to rpc::timeout_error or timed_out_error), the code was throwing mutation_write_timeout_exception but not marking the write_timeouts metric. This resulted in counter write timeouts not being counted in the scylla_storage_proxy_coordinator_write_timeouts metric. Regular writes go through mutate_internal -> mutate_end, which catches mutation_write_timeout_exception and marks the metric. However, counter writes use a separate code path (mutate_counters) that has its own exception handling but was missing the metric update. This fix adds get_stats().write_timeouts.mark() before throwing the timeout exception in the counter write path, consistent with how the CAS path handles cas_write_timeouts. Refs: https://scylladb.atlassian.net/browse/SCYLLADB-245 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> Closes scylladb/scylladb#28019	2026-01-14 17:50:46 +02:00
Gleb Natapov	bee5f63cb6	topology coordinator: complete pending operation for a replaced node A replaced node may have pending operation on it. The replace operation will move the node into the 'left' state and the request will never be completed. More over the code does not expect left node to have a request. It will try to process the request and will crash because the node for the request will not be found. The patch checks is the replaced node has peening request and completes it with failure. It also changes topology loading code to skip requests for nodes that are in a left state. This is not strictly needed, but makes the code more robust. Fixes #27990 Closes scylladb/scylladb#28009	2026-01-14 13:11:27 +01:00
Botond Dénes	551eecab63	Merge 'EAR: deprecate the replicated key provider' from Calle Wilund Refs #22733. Adds runtime warning and docs info that replicated provider is deprecated and will be removed. Fixes #27292 Closes scylladb/scylladb#27270 * github.com:scylladb/scylladb: docs::encryption: Add warning that replicated provider is deprecated ent::encryption: Switch default key provider from replicated to local replicated_key_provider: Add deprecation warning on usage	2026-01-14 13:47:23 +02:00
Patryk Jędrzejczak	6b5923c64e	test: test_group0_schema_versioning: wait for schema sync in system.local `test_schema_versioning_with_recovery` is currently flaky. It performs a write with CL=ALL and then checks if the schema version is the same on all nodes by calling `verify_table_versions_synced`. All nodes are expected to sync their schema before handling the replica write. The node in RECOVERY mode should do it through a schema pull, and other nodes should do it through a group 0 read barrier. The problem is in `verify_local_schema_versions_synced` that compares the schema versions in `system.local`. The node in RECOVERY mode updates the schema version in `system.local` after it acknowledges the replica write as completed. Hence, the check can fail. We fix the problem by making the function wait until the schema versions match. Note that RECOVERY mode is about to be retired together with the whole gossip-based topology in 2026.2. So, this test is about to be deleted. However, we still want to fix it, so that it doesn't bother us in older branches. Fixes #23803 Closes scylladb/scylladb#28114	2026-01-14 09:55:45 +01:00
Jakub Smolar	aefd815194	test.py: add pexpect to the dependencies Use pexpect to control a presistent GDB process with pattern reads and timeouts. This makes 'scylla_gdb' tests faster and less flaky. Added python3-pexpect in 'install-dependencies.sh'. Closes scylladb/scylladb#26419 [avi: build optimized clang 21.1.8 regenerated frozen toolchain with optimized clang from https://devpkg.scylladb.com/clang/clang-21.1.8-Fedora-43-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-21.1.8-Fedora-43-x86_64.tar.gz ] Closes scylladb/scylladb#28134	2026-01-14 10:17:37 +02:00
Botond Dénes	122b7847e5	Merge 'index: Accept view properties in CREATE INDEX' from Dawid Mędrek Problem ------- Secondary indexes are implemented via materialized views under the hood. The way an index behaves is determined by the configuration of the view. Currently, it can be modified by performing the CQL statement `ALTER MATERIALIZED VIEW` on it. However, that raises some concerns. Consider, for instance, the following scenario: 1. The user creates a secondary index on a table. 2. In parallel, the user performs writes to the base table. 3. The user modifies the underlying materialized view, e.g. by setting the `synchronous_updates` to `true` [1]. Some of the writes that happened before step 3 used the default value of the property (which is `false`). That had an actual consequence on what happened later on: the view updates were performed asynchronously. Only after step 3 had finished did it change. Unfortunately, as of now, there is no way to avoid a situation like that. Whenever the user wants to configure a secondary index they're creating, they need to do it in another schema change. Since it's not always possible to control how the database is manipulated in the meantime, it leads to problems like the one described. That's not all, though. The fact that it's not possible to configure secondary indexes is inconsistent with other schema entities. When it comes to tables or materialized views, the user always have a means to set some or even all of the properties during their creation. Solution -------- The solution to this problem is extending the `CREATE INDEX` CQL statement by view properties. The syntax is of form: ``` > CREATE INDEX <index name> > .. ON <keyspace>.<table> (<columns>) > .. WITH <properties> ``` where `<properties>` corresponds to both index-specific and view properties [2, 3]. View properties can only be used with indexes implemented with materialized views; for example, it will be impossible to create a vector index when specifying any view property (see examples below). When a view property is provided, it will be applied when creating the underlying materialized view. The behavior should be similar to how other CQL statements responsible for creating schema entities work. High-level implementation strategy ---------------------------------- 1. Make auxiliary changes. 2. Introduce data structures representing the new set of index properties: both index-specific and those corresponding to the underlying view. 3. Extend `CREATE INDEX` to accept view properties. 4. Extend `DESCRIBE INDEX` and other `DESCRIBE` statements to include view properties in their output. User documentation is also updated at the steps to reflect the corresponding changes. Implementation considerations ----------------------------- There are a number of schema properties that are now obsolete. They're accepted by other CQL statements, but they have no effect. They include: * `index_interval` * `replicate_on_write` * `populate_io_cache_on_flush` * `read_repair_chance` * `dclocal_read_repair_chance` If the user tries to create a secondary index specifying any of those keywords, the statement will fail with an appropriate error (see examples below). Unlike materialized views, we forbid specifying the clustering order when creating a secondary index [4]. This limitation may be lifted later on, but it's a detail that may or may not prove troublesome. It's better to postpone covering it to when we have a better perspective on the consequences it would bring. Examples -------- Good examples ``` > CREATE INDEX idx ON ks.t (v); > CREATE INDEX idx ON ks.t (v) WITH comment = 'ok view property'; > CREATE INDEX idx ON ks.t (v) .. WITH comment = 'multiple view properties are ok' .. AND synchronous_updates = true; > CREATE INDEX idx ON ks.t (v) .. WITH comment = 'default value ok' .. AND synchronous_updates = false; ``` Bad examples ``` > CREATE INDEX idx ON ks.t (v) WITH replicate_on_write = true; SyntaxException: Unknown property 'replicate_on_write' > CREATE INDEX idx ON ks.t (v) .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="Cannot specify options for a non-CUSTOM index" > CREATE CUSTOM INDEX idx ON ks.t (v) .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="CUSTOM index requires specifying the index class" > CREATE CUSTOM INDEX idx ON ks.t (v) .. USING 'vector_index' .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="You cannot use view properties with a vector index" > CREATE INDEX idx ON ks.t (v) WITH CLUSTERING ORDER BY (v ASC); InvalidRequest: Error from server: code=2200 [Invalid query] message="Indexes do not allow for specifying the clustering order" ``` and so on. For more examples, see the relevant tests. References: [1] https://docs.scylladb.com/manual/branch-2025.4/cql/cql-extensions.html#synchronous-materialized-views [2] https://docs.scylladb.com/manual/branch-2025.4/cql/secondary-indexes.html#create-index [3] https://docs.scylladb.com/manual/branch-2025.4/cql/mv.html#mv-options [4] https://docs.scylladb.com/manual/branch-2025.4/cql/dml/select.html#ordering-clause Fixes scylladb/scylladb#16454 Backport: not needed. This is an enhancement. Closes scylladb/scylladb#24977 * github.com:scylladb/scylladb: cql3: Extend DESC INDEX by view properties cql3: Forbid using CLUSTERING ORDER BY when creating index cql3: Extend CREATE INDEX by MV properties cql3/statements/create_index_statement: Allow for view options cql3/statements/create_index_statement: Rename member cql3/statements/index_prop_defs: Re-introduce index_prop_defs cql3/statements/property_definitions: Add extract_property() cql3/statements/index_prop_defs.cc: Add namespace cql3/statements/index_prop_defs.hh: Rename type cql3/statements/view_prop_defs.cc: Move validation logic into file cql3/statements: Introduce view_prop_defs.{hh,cc} cql3/statements/create_view_statement.cc: Move validation of ID schema/schema.hh: Do not include index_prop_defs.hh	2026-01-14 09:54:27 +02:00
Pavel Emelyanov	e57ee84662	util: Re-use seastar::util::memory_data_sink A data_sink that stores buffers into an in-memory collection had appeared in seastar recently. In Scylla there's similar thing that uses memory_data_sink_buffer as a container, so it's possible to drop the data_sink_impl iself in favor of seastar implementation. For that to work there should be append_buffers() overload for the aforementioned container. For its nice implementation the container, in turn, needs to get push_back() method and value_type trait. The method already exists, but is called put(), so just rename it. There's one more user of it this method in S3 client, and it can enjoy the added append_buffers() helper. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#28124	2026-01-14 08:54:00 +02:00
Avi Kivity	3fc5a32136	tools: toolchain: update instructions for building optimized clang with version information The instructions for building optimized clang neglected to mention that the clang version to be built must be specified. Correct that. Closes scylladb/scylladb#28135	2026-01-14 06:46:20 +02:00
Botond Dénes	6e17bf5c1a	tools/scylla-nodetool: migrate to std::localtime fmt::localtime() is now deprecated, users should migrate to equivalents from the standard libraries. std::localtime is not thread safe, so a local wrapper is introduced, based on the thread-safe localtime_r() (from libc). Closes scylladb/scylladb#27821	2026-01-13 20:46:31 +02:00
Avi Kivity	489d1a0fbc	Merge 'replica: don't throw exceptions for read timeout' from Botond Dénes Read timeouts are a common occurence and they typically occur when the replica is overloaded. So throwing exceptions for read timeouts is very harmful. Be careful not to thow exceptions while propagating them up the future chain. Add a test to enfore and detect regressions. Fixes: scylladb/scylladb#25062 Improvement, normally not a backport candidate, but we may decide to backport if customer(s) are found to suffer from this. Closes scylladb/scylladb#25068 * github.com:scylladb/scylladb: reader_permit: remove check_abort() test/boost/database_test: add test for read timeout exceptions sstables/mx/reader: don't throw exceptions on the read-path readers/multishard: don't throw exceptions on the read-path replica/table: don't throw exceptions on the read-path multishard_mutation_query: fix indentation multishard_mutation_query: don't throw exceptions on the read-path service/storage_proxy: don't throw exceptions on the full-scan path cql3/query_processor: don't throw exceptions on the read-path reader_permit: add get_abort_exception()	2026-01-13 16:17:41 +02:00
Avi Kivity	c6dfae5661	treewide: #include Seastar headers with angle brackets Seastar is an external library from the point of view of ScyllaDB, so should be included with angle brackets. Closes scylladb/scylladb#27947	2026-01-13 14:56:15 +02:00
Tomasz Grabiec	63b9a7e2b5	test: pylib: log_browsing: Grep logs without considering newly appended lines At the end of the test case, the framework greps logs for errors and backtraces. The servers are still running at this point. Some test cases enable debug-level logging. If servers manage to produce new lines between the python script processes them, the grep will never return. Protect against this by grepping over a file snapshot. Fixes #28086 Closes scylladb/scylladb#28088	2026-01-13 14:41:02 +02:00
Nadav Har'El	fc6fff61d1	docs/alternator: add document on reducing Alternator network costs This patch adds a new document, docs/alternator/network.md, explaining the various mechanisms that can be used to reduce network usage in Alternator. It explains compression of requests and responses, header reduction, rack-aware routing, and RPC compression. Many of these topics - especially support in the client libraries - are work in progress, so some details are still missing in the new document. Still, I think it is a good start that can be improved later. Fixes #27915. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27927	2026-01-13 14:29:01 +02:00
Pavel Emelyanov	9ffd22491f	test: Validate S3 endpoints new format works Extend the test_get_object_store_endpoints() test to configure S3 endpoints in full-url format and check that they are rendered properly via API/CQL. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:24:18 +03:00
Pavel Emelyanov	bd225784bd	docs: Update docs according to new endpoints config option format Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:24:06 +03:00
Pavel Emelyanov	f227de24b2	object_storage: Create s3 client with "extended" endpoint name For this, add the s3::client::make(endpoint, ...) overload that accepts endpoint in proto://host:port format. Then it parses the provided url and calls the legacy one, that accepts raw host string and config with port, https bit, etc. The generic object_storage_endpoint_param no longer needs to carry the internal s3::endpoint_config, the config option parsing changes respectively. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:24:06 +03:00
Pavel Emelyanov	8f97e6b3de	s3/storage: Tune config updating Don't prepare s3::endpoint_config from generic code, jut pass the region and iam_role_arn (those that can potentially change) to the callback. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:24:06 +03:00
Pavel Emelyanov	bee3564564	sstable: Shuffle args for s3_client_wrapper Make it construct like gs_client_wrapper -- with generic endpoint param reference and make the storage-specific casts/gets/whatever internally. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:24:06 +03:00
Pavel Emelyanov	83e88d206c	test: Rename badconf variable into objconf It's not actually a "bad" config, it's just some config the test works with. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:23:20 +03:00
Pavel Emelyanov	9c627bc44a	test: Split the object_store/test_get_object_store_endpoints test It tests two things -- the way object storage config is represented via API and CQL (from sytem.config) and that updating config affects CREATE KEYSPACE CQL (with keyspace storage options) It's better to split the test, as its former part is going to be extented to validate old/new config formats (see #26570) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:23:03 +03:00

1 2 3 4 5 ...

51514 Commits