scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-03 13:37:04 +00:00

Author	SHA1	Message	Date
Michał Jadwiszczak	7dfb76f9a7	db/view/view_building_worker: wrap `shared_sstable` in `foreign_ptr` When a staging sstable is registered to view building worker, it needs to make a round trip from its original shard to shard 0 (in order to create a view building task) and back (to be eventually processed). Until now this was done using plain `sstables::shared_sstable` (= `lw_shared_ptr`) which is not safe to be moved between shards. This patch fixes this by wrapping the pointer in `foreign_ptr` and obtains necessary informations (owner shard, last token) on the original shard (instead of on shard0). Then all of those objects are put into freshly introduced structure `staging_sstable_task_info`, which can be safely moved between shards. Fixes scylladb/scylladb#25859	2025-09-18 02:57:36 +02:00
Michał Jadwiszczak	50678030c0	db/view/view_building_worker: use table id in `register_staging_sstable_tasks()` There is no need to pass the pointer only to get id of the table.	2025-09-18 02:57:35 +02:00
Michał Jadwiszczak	b44c223d47	db/view/view_building_worker: move helper functions higher So they can be used in `view_building_worker::register_staging_sstable_tasks()`.	2025-09-18 02:57:35 +02:00
Piotr Smaron	bdb90ee15c	set ssl_* columns in system.clients Depends on https://github.com/scylladb/seastar/pull/2651 Missing columns have been present since probably forever - they were added to the schema but never assigned any value: ``` cqlsh> select * from system.clients; ------------------+------------------------ ... ssl_cipher_suite \| null ssl_enabled \| null ssl_protocol \| null ... ``` This patch sets values of these columns: - with a TLS connection, the 3 TLS-related fields are filled in, - without TLS, `ssl_enabled` is set to `false` and other columns are `null`, - if there's an error while inspecting TLS values, the connection is dropped. We want to save the TLS info of a connection just after accepting it, but without waiting for a TLS handshake to complete, so once the connection is accepted, we're inspecting it in the background for the server to be able to accept next connections immediately. Later, when we construct system.clients virtual table, the previously saved data can be instantaneously assigned to client_data, which is a struct representing a row in system.clients table. This way we don't slow down constructing this table by more than necessary, which is relevant for cases with plenty of connections. Fixes: #9216 Closes scylladb/scylladb#22961	2025-09-17 16:29:55 +03:00
Nadav Har'El	3c0032deb4	alternator: fix bug in combination of AttributeUpdates + ReturnValues In test/alternator/test_returnvalues.py we had tests for the ReturnValues feature on UpdateItem requests - but we only tested UpdateItem requests with the "modern" UpdateExpression, and forgot to test the combination of ReturnValues with the old AttributeUpdates API. It turns out this combination is buggy: when both ReturnValues=ALL_OLD and AttributeUpdates need the previous value of the item, we may wrongly std::move() the value out, and the operation will fail with a strange error: An error occurred (ValidationException) when calling the UpdateItem operation: JSON assert failed on condition 'IsObject()' The fix in this patch is trivial - just move the std::move() to the correct place, after both UpdateExpression and AttributeUpdates handling is done. This patch also includes a reproducing test, which fails before this patch and passes with it - and of course passes on DynamoDB. This test reproduces two cases where the bug happened, as well as one case where it didn't (to make sure we don't regress in what already worked). Fixes #25894 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#25900	2025-09-17 16:04:01 +03:00
Piotr Dulikowski	6a90a1fd29	Merge 'db/view/view_building_worker: split batch's data preparation and execution' from Michał Jadwiszczak The view building batch lives on shard0 but it might be doing work on shard which owns the tablet replica. Until now the batch data was accessed from multiple shards (shard0 and where the batch was executed). This patch fixes this by splitting tasks execution into: - preparation which is always happening on shard0 - actual execution of the tasks on relevant shard, but all necessary data is copied to the shard and batch object isn't accessed. Fixes https://github.com/scylladb/scylladb/issues/25804 View building coordinator hasn't been released yet, so no backport needed. Closes scylladb/scylladb#26058 * github.com:scylladb/scylladb: db/view/view_building_worker: move try-catch outside `invoke_on()` db/view/view_building_worker: split batch's data preparation and execution	2025-09-17 14:17:25 +02:00
Botond Dénes	30a3f61fa0	Merge 'compaction: handle exception in expected_total_workload' from Aleksandra Martyniuk expected_total_workload methods of scrub compaction tasks create a vector of table_info based on table names. If any table was already dropped, then the exception is thrown. It leaves table_info in corrupted state and node crashes with `free(): invalid size`. Return std::nullopt if an exception was thrown to indicate that total workload cannot be found. Fixes: #25941. No release branches affected Closes scylladb/scylladb#25944 * github.com:scylladb/scylladb: tasks: get progress of failed task based on children compaction: handle exception in expected_total_workload	2025-09-17 15:10:19 +03:00
Nadav Har'El	e322902506	Merge 'index, metrics: add per-index metrics' from Michał Hudobski This patch adds the possibility to track metrics per secondary index. Currently, only a histogram of query latencies is tracked, but more metrics can be added in the future. To add a new metric, it needs to be added to the index_metrics struct in index/secondary_index_manager.hh and then initialized in index/secondary_index_manager.cc in the constructor of the index_metrics struct. The metrics are created when the index is created and removed when the index is dropped. First lines of the new metric: \# HELP scylla_index_query_latencies Index query latencies \# TYPE scylla_index_query_latencies histogram scylla_index_query_latencies_sum{idx="test_i_idx",ks="test"} 640 scylla_index_query_latencies_count{idx="test_i_idx",ks="test"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="640.000000"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="768.000000"} 1 Fixes: https://github.com/scylladb/scylladb/issues/25970 Closes scylladb/scylladb#25995 * github.com:scylladb/scylladb: test: verify that the index metric is added index, metrics: add per-index metrics	2025-09-17 14:54:12 +03:00
Szymon Malewski	776f90e2f8	alternator/expressions.g: Fix antlr3 missing token leak This patch overrides the antlr3 function that allocates the missing tokens that would eventually leak. The override stores these tokens in a vector, ensuring memory is freed whenever the parser is destroyed. Solution is copied from CQL implementation. A unit test to reproduce the issue is added - leak would be reported by ASAN, when running this test in debug mode - the test passed but the leak is discovered when the test file exits. Fixes #25878 Closes scylladb/scylladb#25930	2025-09-17 13:05:24 +03:00
Benny Halevy	3a6208b319	utils: stall_free: clear_gently: release wrapped objects As discussed in https://github.com/scylladb/scylladb/pull/24606#discussion_r2281870939 clear_gently of shared pointers should release the wrapped object reference and when the object's use_count reaches 1, the object itself would be cleared_gently, before it's destroyed. This behavior is similar to the way we clear gently containers like arrays or vectors, and so it is extended in this patch to smart pointers like unique_ptr and foreign_ptr. The unit tests are adjusted respectively to expect the smart pointers to be reset after clear_gently, plus the use of `reset()` for `foreign_ptr<shared_ptr<>>` was replaced by `clear_gently().get()` which now ensures the reference to a shared object is released, and awaited for, if it happens on a foreign owner shard, unlike reset of a foreign_ptr that kicks off destroy of that shared object in the background on the owner shard - causing flakiness. Fixes #25723 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#25759	2025-09-17 11:44:26 +03:00
Patryk Jędrzejczak	454eb08cb4	Merge 'group0: remove obsolete "stop_before_becoming_raft_voter" error injection' from Emil Maskovsky The Raft topology workflow was changed by the limited voters feature: nodes no longer request votership themselves. As a result, the "stop_before_becoming_raft_voter" error injection is now obsolete and has been removed. Fixes: scylladb/scylladb#23418 No backport: This re-enables a test, only needed for master. Closes scylladb/scylladb#26042 * https://github.com/scylladb/scylladb: group0: remove obsolete "stop_before_becoming_raft_voter" error injection test/random_failures: preserve test repeatability when removing error injections	2025-09-17 10:38:32 +02:00
Michał Jadwiszczak	d98237b33c	db/view/view_building_worker: move try-catch outside `invoke_on()` It's just stylist change, to me doing `invoke_on()` in try-catch block looks better than the other way.	2025-09-16 23:15:44 +02:00
Michał Jadwiszczak	9458ceff8f	db/view/view_building_worker: split batch's data preparation and execution The view building batch lives on shard0 but it might be doing work on shard which owns the tablet replica. Until now the batch data was accessed from multiple shards (shard0 and where the batch was executed). This patch fixes this by splitting tasks execution into: - preparation which is always happening on shard0 - actual execution of the tasks on relevant shard, but all necessary data is copied to the shard and batch object isn't accessed. Fixes scylladb/scylladb#25804	2025-09-16 23:13:36 +02:00
Patryk Jędrzejczak	368d70ee15	Merge 'LWT: implement fencing' from Petr Gusev This PR consists of three parts: * Small refactoring of the fencing APIs in storage_proxy (renames + comments + some functions were extracted) * Implement the fencing for LWT verbs itself. This includes checking the fencing token before and after local replica data accesses. * Two new `test.py` tests in `test_fencing.py`, which check the fencing in some real-world scenarios. Backport: no need -- fencing for LWT requests is needed primarily for LWT over tablets, which is not released yet. Fixes scylladb/scylladb#22332 Closes scylladb/scylladb#25550 * https://github.com/scylladb/scylladb: test_tablets_lwt: eliminate redundant disable_tablet_balancing test_fencing: add test_lwt_fencing_upgrade pylib: extract upgrade helpers from test_sstable_compression_dictionaries_upgrade.py test_fencing: add test_fenced_out_on_tablet_migration_while_handling_paxos_verb test_fencing: test_fence_lwt_during_bootstap pylib/rest_client.py: encode injection name storage_proxy_stats: add fenced_out_requests metric storage_proxy: add fencing to Paxos verbs storage_proxy::apply_fence: add overload that throws on failure storage_proxy: extract apply_fence_result sp::apply_fence: rename to apply_fence_on_ready sp::apply_fence: rename to check_fence sp::apply_fence: make non-generic	2025-09-16 23:40:48 +03:00
Ernest Zaslavsky	d624413ddd	treewide: Move query related files to a new `query` directory As requested in #22120, moved the files and fixed other includes and build system. Moved files: - query.cc - query-request.hh - query-result.hh - query-result-reader.hh - query-result-set.cc - query-result-set.hh - query-result-writer.hh - query_id.hh - query_result_merger.hh Fixes: #22120 This is a cleanup, no need to backport Closes scylladb/scylladb#25105	2025-09-16 23:40:47 +03:00
Pavel Emelyanov	6fb66b796a	s3: Add metrics to show S3 prefetch bytes The chunked download source sends large GET requests and then consumes data as it arrives. Sometimes it can stop reading from socket early and drop the in-flight data. The existing read-bytes metrics show only the number of consumed bytes, we we also want to know the number of requested bytes Refs #25770 (accounting of read-bytes) Fixes #25876 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#25877	2025-09-16 23:40:47 +03:00
Sergey Zolotukhin	2640b288c2	raft: disable caching for raft log. This change disables caching for raft log table due to the following reasons: * Immediate reason is a deficiency in handling emerging range tombstones in the cache, which causes stalls. * Long-term reason is that sequential reads from the raft log do not benefit from the cache, making it better to bypass it to free up space and avoid stalls. Fixes scylladb/scylladb#26027 Closes scylladb/scylladb#26031	2025-09-16 23:40:47 +03:00
Pavel Emelyanov	d69a51f42a	compaction: Use function when filtering compaction tasks for stopping The compaction_manager::stop_compaction() method internally walks the list of tasks and compares each task's compacting_table (which is compaction group view pointer) with the given one. In case this stop_compaction() method is called via API for a specific table, the method walks the list of tasks for every compaction group from the table, thus resulting in nr_groups * nr_tasks complexity. Not terrible, but not nice either. The proposal is to pass filtering function into the inner do_stop_ongoing_compactions() method. Some users will pass a simple "return true" lambda, but those that need to stop compactions for a specitif table (e.g. -- the API handler) will effectively walk the list of tasks once comparing the given compaction group's schema with the target table one (spoiler: eventually this place will also be simplified not to mess with replica::table at all). One ugliness with the change is the way "scope" for logging message is collected. If all tasks belong to the same table, then "for table ..." is printed in logs. With the change the scope is no longer known instantly and is evaluated dynamically while walking the list of tasks. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#25846	2025-09-16 23:40:47 +03:00
Michał Chojnowski	68e6141211	scylla-gdb: add `scylla prepared-statements` Add a helper which prints all prepared statements currently present in the query processor. Example output: ``` (gdb) scylla prepared-statements (cql3::cql_statement)(0x600003d71050): SELECT FROM ks.ks WHERE pk = ? (cql3::cql_statement*)(0x600003972b50): SELECT pk FROM ks.ks WHERE pk = ? ``` Closes scylladb/scylladb#26007	2025-09-16 23:40:47 +03:00
Botond Dénes	0cf6a648bb	Merge 'Default create keyspace syntax' from Dario Mirovic Allow for the following CQL syntax: ``` CREATE KEYSPACE [IF NOT EXISTS] <name>; ``` for example: ``` CREATE KEYSPACE test_keyspace; ``` With this syntax all the keyspace's parameters would be defaulted to: replication strategy = `NetworkTopologyStrategy`, replication factor = number of racks , but excluding racks that only have arbiter nodes storage options, durable writes = defaults we normally would use, tablets enabled if they are enabled in the db configuration, e.g. scylla.yaml or db/config.cc by default. Options besides `replication` already have defaults. `replication` had to be specified, but it could be an empty set, where defaults for sub-options (replication strategy and replication factor) would be used - `replication = {}`. Now there is no need for specifying an empty set - omitting `replication = {}` has the same effect as `replication = {}`. Since all the options now have defaults, `WITH` is optional for `CREATE KEYSPACE` statement. Fixes #25145 This is an improvement, no backport needed. Closes scylladb/scylladb#25872 * github.com:scylladb/scylladb: docs: cql: default create keyspace syntax test: cqlpy: add test for create keyspace with no options specified cql: default `CREATE KEYSPACE` syntax	2025-09-16 23:40:47 +03:00
Emil Maskovsky	943af1ef1c	topology_coordinator: consistently rethrow `raft::request_aborted` for direct/global commands Ensure all direct and global topology commands rethrow the `raft::request_aborted` exception when aborted, typically due to leadership changes. This makes abortion explicit to callers, enabling proper handling such as retries or workflow termination. This change completes the work started in PR scylladb/scylladb#23962, covering all remaining cases where the exception was not rethrown. Fixes: scylladb/scylladb#23589 No backport: No related issues observed in previous versions; backport not required. Closes scylladb/scylladb#26021	2025-09-16 23:40:47 +03:00
Emil Maskovsky	87bd328873	group0: remove obsolete "stop_before_becoming_raft_voter" error injection The Raft topology workflow was changed by the limited voters feature: nodes no longer request votership themselves. As a result, the "stop_before_becoming_raft_voter" error injection is now obsolete and has been removed. Fixes: scylladb/scylladb#23418	2025-09-16 18:24:27 +02:00
Emil Maskovsky	0453052d66	test/random_failures: preserve test repeatability when removing error injections The order of entries in the ERROR_INJECTIONS list determines test repeatability for a given random seed. To allow removing error injections without affecting the order of the remaining ones, removed injections are now renamed with a "REMOVED_" prefix instead of being deleted. This ensures they are ignored by the tests, while the sequence of active injections—and thus test reproducibility—remains unchanged.	2025-09-16 18:22:45 +02:00
Michał Hudobski	3364cc96f5	test: verify that the index metric is added This commit adds a test that performs a sanity check that the implemented metric is actually being added to Scylla's metrics and has the correct value.	2025-09-16 18:10:01 +02:00
Aleksandra Martyniuk	3324f08e9c	tasks: get progress of failed task based on children Currently, for failed tasks task_manager::task::impl::get_progress attempts to find expected_total_workload. However, if the task has finished long time ago, the state might have totally changed, e.g. some tables might have been dropped or have changed their sizes. Due to that, the result of expected_total_workload might be irrelevant. Count the progress of a finish task based on children only, regardless whether the task has succeeded or failed.	2025-09-16 17:15:01 +02:00
Michał Hudobski	b09d1f0a98	index, metrics: add per-index metrics This patch adds the possibility to track metrics per secondary index. Currently, only a histogram of query latencies is tracked, but more metrics can be added in the future. To add a new metric, it needs to be added to the index_metrics struct in index/secondary_index_manager.hh and then initialized in index/secondary_index_manager.cc in the constructor of the index_metrics struct. The metrics are created when the index is created and removed when the index is dropped. First lines of the new metric: \# HELP scylla_index_query_latencies Index query latencies \# TYPE scylla_index_query_latencies histogram scylla_index_query_latencies_sum{idx="test_i_idx",ks="test"} 640 scylla_index_query_latencies_count{idx="test_i_idx",ks="test"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="640.000000"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="768.000000"} 1	2025-09-16 14:03:43 +02:00
Patryk Jędrzejczak	9efe250a8f	Merge 'gossiper: ensure gossiper operations are executed in gossiper scheduling group' from Sergey Zolotukhin Sometimes gossiper operations invoked from storage_service and other components run under a non-gossiper scheduling group. If these operations acquire gossiper locks, priority inversion can occur: higher-priority gossiper tasks may wait behind lower-priority tasks (e.g. streaming), which can cause gossiper slowness or even failures. This patch ensures that gossiper operations requiring locks on gossiper structures are explicitly executed in the gossiper scheduling group. To help detect similar issues in the future, a warning is logged whenever a gossiper lock is acquired under a non-gossiper scheduling group. Fixes scylladb/scylladb#25907 Refs: scylladb/scylladb#25702 Backport: this patch fixes an issue with gossiper operations scheduling group, that might affect topology operations, therefore backport is needed to 2025.1, 2025.2, 2025.3 Closes scylladb/scylladb#25981 * https://github.com/scylladb/scylladb: gossiper: ensure gossiper operations are executed in gossiper scheduling group gossiper: fix wrong gossiper instance used in `force_remove_endpoint`	2025-09-16 10:14:15 +02:00
Asias He	54162a026f	scylla-nodetool: Add --incremental-mode option to cluster repair The `--incremental-mode` option specifies the incremental repair mode. Can be 'disabled', 'regular', or 'full'. 'regular': The incremental repair logic is enabled. Unrepaired sstables will be included for repair. Repaired sstables will be skipped. The incremental repair states will be updated after repair. 'full': The incremental repair logic is enabled. Both repaired and unrepaired sstables will be included for repair. The incremental repair states will be updated after repair. 'disabled': The incremental repair logic is disabled completely. The incremental repair states, e.g., repaired_at in sstables and sstables_repaired_at in the system.tablets table, will not be updated after repair. When the option is not provided, it defaults to regular. Fixes #25931 Closes scylladb/scylladb#25969	2025-09-16 10:23:22 +03:00
Botond Dénes	ee7c85919e	Revert "treewide: seastar module update and fix broken rest client" This reverts commit `44d34663bc` of PR https://github.com/scylladb/scylladb/pull/25915. Breaks articact tests on ARM, blocking us from building new images from master.	2025-09-16 08:31:08 +03:00
Sergey Zolotukhin	6c2a145f6c	gossiper: ensure gossiper operations are executed in gossiper scheduling group Sometimes gossiper operations invoked from storage_service and other components run under a non-gossiper scheduling group. If these operations acquire gossiper locks, priority inversion can occur: higher-priority gossiper tasks may wait behind lower-priority tasks (e.g. streaming), which can cause gossiper slowness or even failures. This patch ensures that gossiper operations requiring locks on gossiper structures are explicitly executed in the gossiper scheduling group. To help detect similar issues in the future, a warning is logged whenever a gossiper lock is acquired under a non-gossiper scheduling group. Fixes scylladb/scylladb#25907	2025-09-15 12:49:07 +02:00
Petr Gusev	1d270020f2	test_tablets_lwt: eliminate redundant disable_tablet_balancing This is a refactoring commit.	2025-09-15 12:40:10 +02:00
Petr Gusev	7060265d5f	test_fencing: add test_lwt_fencing_upgrade This test verifies that upgrading to a Scylla version with LWT fencing does not disrupt existing LWT workloads.	2025-09-15 12:34:45 +02:00
Petr Gusev	49b036cf2b	pylib: extract upgrade helpers from test_sstable_compression_dictionaries_upgrade.py We want to reuse them to test upgade for LWT fencing	2025-09-15 12:34:45 +02:00
Petr Gusev	82f0235e4b	test_fencing: add test_fenced_out_on_tablet_migration_while_handling_paxos_verb This test verifies that the fencing token is checked on replicas after the local Paxos state is updated. This ensures that if we failed to drain an LWT request during topology changes the replicas where paxos verbs got stuck won't contributed to the target CLs.	2025-09-15 12:34:45 +02:00
Petr Gusev	0156850605	test_fencing: test_fence_lwt_during_bootstap	2025-09-15 12:09:08 +02:00
Dawid Mędrek	18cb748268	docs/snitch: Document default DC and rack The existing article is already extensive and covers pretty much all of the details useful to the user. However, the document lacked minute information like the default names of the DC and rack in case of SimpleSnitch or it didn't explicitly specify the behavior of RackInferringSnitch (though arguably the existing example was more than sufficient). Fixes scylladb/scylladb#23528 Closes scylladb/scylladb#25700	2025-09-15 11:47:22 +02:00
Petr Gusev	92b165b8c0	pylib/rest_client.py: encode injection name Sometimes it's convenient to use slashes in injection names, for example my_component/my_method/my_condition. Without quote() we get 'handler not found' error from Scylla.	2025-09-15 11:24:53 +02:00
Petr Gusev	819d59eeba	storage_proxy_stats: add fenced_out_requests metric We have to drop const qualifiers because now check_fence needs to mutate this metric.	2025-09-15 11:24:53 +02:00
Petr Gusev	6d7af84fed	storage_proxy: add fencing to Paxos verbs This commit adds fencing support to all Paxos verbs: * Pass an optional (for backward compatibility) fencing_token as a parameter to the prepare, accept, learn, and prune verbs. * Call apply_fence twice — before and after accessing local data. This ensures that if the coordinator is fenced out mid-request, the replica does not return success, which would otherwise incorrectly contribute to achieving the target CL. Without this, a user might observe successful writes that become unreadable after the topology operation completes. * For prune, call apply_fence only once because it does not return a response to the LWT coordinator. Fixes scylladb/scylladb#22332	2025-09-15 11:24:53 +02:00
Petr Gusev	ab750af711	storage_proxy::apply_fence: add overload that throws on failure This new apply_fence overload checks the fence and reports a failure by throwing a regular exception.	2025-09-15 11:24:53 +02:00
Petr Gusev	a2bde28efe	storage_proxy: extract apply_fence_result This commit refactors a repeated pattern that applies the fence and embeds the exception into the exception_variant class by extracting it into a separate method.	2025-09-15 11:24:53 +02:00
Petr Gusev	bdfea2fa4c	sp::apply_fence: rename to apply_fence_on_ready This overload performs the fence check only when the future is ready. In this commit, we give it a more descriptive name to better reflect its behavior. Additionally, we add extensive comments explaining the overall fencing scheme and the motivation behind this specific overload.	2025-09-15 11:24:53 +02:00
Petr Gusev	4a5c856d44	sp::apply_fence: rename to check_fence We plan to introduce several additional apply_fence overloads in upcoming commits. To avoid ambiguity, this change renames the existing base function to check_fence.	2025-09-15 10:56:20 +02:00
Petr Gusev	7fb5b2006b	sp::apply_fence: make non-generic It's simpler and more consistent to always use locator::host_id for caller_address. We also slightly reformulate the comment for sp.apply_fence here.	2025-09-15 10:56:20 +02:00
Michał Jadwiszczak	dc1ffd2c10	service/storage_service: drain `view_building_worker` earlier Similarly to view builder, view building worker needs to be drained in `storage_service::do_drain()`. Storage service drain is happening at the same beginning of shutdown procedure. Before this patch, the worker was still building views after the storage service was drained and this caused errors like: `Error applying view update to (named_gate_closed_exception)` and `locator::no_such_tablet_map`. Fixes scylladb/scylladb#25908 Closes scylladb/scylladb#25984	2025-09-15 11:29:19 +03:00
Gleb Natapov	d3badf7406	storage_service: change node_ops_info::ignore_nodes to host id It drop useless translation from id to ip during removenode through topology coordinator. Closes scylladb/scylladb#25958	2025-09-15 10:18:24 +02:00
Sergey Zolotukhin	340413e797	gossiper: fix wrong gossiper instance used in `force_remove_endpoint` `gossiper::force_remove_endpoint` is always executed on shard 0 using `invoke_on`. Since each shard has its own `gossiper` instance, if `force_remove_endpoint` is called from a shard other than shard 0, `my_host_id()` may be invoked on the wrong `gossiper` object. This results in undefined behavior due to unsynchronized access to resources on another shard.	2025-09-15 08:54:59 +02:00
Aleksandra Martyniuk	55fde70f8d	api: tasks: task_manager: keep children identities in chunked_{array,vector} task_status contains a vector of children identities. If the number of children is large, we may hit oversized allocation. Change all types of children-related containers to chunked_vector. Modify the children type returned from task manager API. Fixes: scylladb#25795. Closes scylladb/scylladb#25923	2025-09-15 08:44:16 +03:00
Nadav Har'El	b4e3d4ac2f	alternator: nicer error message for integer overflow in list index In the DynamoDB API, when "a" is a list attribute, a[999] returns the 1000th element. But if the list isn't that long (e.g., it only has 5 elements), a[999] returns nothing - it's not an error. But it turns out that when the index is so long that it can't even be parsed as an integer, e.g., 99999999999999, DynamoDB does report an error: Invalid ProjectionExpression: List index is not within the allowable range; index: [99999999999999] Before this patch, Alternator also returned an error in this case, with the right type (ValidationException), but with a strange low-level error text: Failed parsing ProjectionExpression 'a[99999999999999]': std::out_of_range (stoi) The problem was that the code (in alternator/expressions.g) ran stoi() without converting its std::out_of_range exception to a better user-facing message. We do this in this patch, and the error message now looks like: Failed parsing ProjectionExpression 'a[99999999999999]': list index out of integer range This patch also includes a test reproducing this error, which passes on DynamDB and on Alternator it fails before this patch and passes with the patch. Fixes #25947 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#25951	2025-09-15 08:43:00 +03:00
Nadav Har'El	208d3986a7	alternator: add explanation of internal tags Alternator needs to store a few pieces of information for each table that it can't store in the existing CQL schema. We decided to store this information in hidden tags - tags named with the prefix "system:" - and we already have four of those: Provisioned RCU and WCU, table creation time, and TTL's expiration-time attribute. This patch moves the definition of all four tags to one place in executor.cc, adds a short comment about the content of each tag, and adds a longer comment explaining why we have these hidden tags at all. It is expected that more hidden tags will follow - e.g., to solve issue #5320. So we expect more tags to be added later in the same place in the code. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#25980	2025-09-15 08:41:39 +03:00

1 2 3 4 5 ...

49421 Commits