scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Nadav Har'El	12cbdfa095	test/cqlpy: add regression test for tombstone_gc in "desc table" The small cqlpy test in this patch is a regression test for issue #14390, which claimed that the Scylla-only "tombstone_gc" option is missing from the output of "describe table". This test shows that this report is not true, at least not when the "server-side describe" is used. "test/cqlpy/run --release ..." shows that this test passes on master and also for Scylla versions all the way back to Scylla 5.2 (Scylla 5.1 did not support server-side describe, so the test fails for that reason). This suggests that the report in issue #14390 was for old-style client-side (cqlsh) describe, which we no longer support, so this issue can be closed. Fixes #14390. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#22354	2025-01-20 16:43:21 +02:00
Avi Kivity	d2869ecb2b	partition_range_compat: drop dependency on boost ranges Unused anyway. Closes scylladb/scylladb#22359	2025-01-20 16:43:21 +02:00
Anna Stuchlik	e340d6a452	doc: remove Open Source references in the docs Fixes https://github.com/scylladb/scylladb/issues/22325 Closes scylladb/scylladb#22377	2025-01-20 16:43:21 +02:00
Botond Dénes	1f20f7810e	Merge 'main, encryption: correct misspellings' from Kefu Chai in this changeset, some misspellings identified by codespell were corrected. --- it's a cleanup, hence no need to backport. Closes scylladb/scylladb#22301 * github.com:scylladb/scylladb: ent/encryption: rename "sie" to "get_opt" ent,main: fix misspellings	2025-01-20 16:43:21 +02:00
Kefu Chai	1ef2d9d076	tree: migrate from boost::adaptors::transformed to std::views::transform Replace remaining uses of boost::adaptors::transformed with std::views::transform to reduce Boost dependencies, following the migration pattern established in `bab12e3a`. This change addresses recently merged code that reintroduced Boost header dependencies through boost::adaptors::transformed usage. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22365	2025-01-17 16:56:40 +02:00
Botond Dénes	47989b1503	Merge 'tasks: add tablet resize virtual task' from Aleksandra Martyniuk In this change, tablet_virtual_task starts supporting tablet resize (i.e. split and merge). Users can see running resize tasks - finished tasks are not presented with the task manager API. A new task state "suspended" is added. If a resize was revoked, it will appear to users as suspended. We assume that the resize was revoked when the tablet number didn't change. Fixes: #21366. Fixes: #21367. No backport, new feature Closes scylladb/scylladb#21891 * github.com:scylladb/scylladb: test: boost: check resize_task_info in tablet_test.cc test: add tests to check revoked resize virtual tasks test: add tests to check the list of resize virtual tasks test: add tests to check spilt and merge virtual tasks status test: test_tablet_tasks: generalize functions replica: service: add split virtual task's children replica: service: pass parent info down to storage_group::split tasks: children of virtual tasks aren't internal by default tasks: initialize shard in task_info ctor service: extend tablet_virtual_task::abort service: retrun status_helper struct from tablet_virtual_task::get_status_helper service: extend tablet_virtual_task::wait tasks: add suspended task state service: extend tablet_virtual_task::get_status service: extend tablet_virtual_task::contains service: extend tablet_virtual_task::get_stats service: add service::task_manager_module::get_nodes tasks: add task_manager::get_nodes tasks: drop noexcept from module::get_nodes replica: service: add resize_task_info static column to system.tablets locator: extend tablet_task_info to cover resize tasks	2025-01-17 14:24:07 +02:00
Piotr Dulikowski	6aa962f5f4	Merge 'Add audit subsystem for database operations' from Paweł Zakrzewski Introduces a comprehensive audit system to track database operations for security and compliance purposes. This change includes: Core Components: - New audit subsystem for logging database operations - Service level integration for proper resource management - CQL statement tracking with operation categories - Login process integration for tenant management Key Features: - Configurable audit logging (syslog/table) - Operation categorization (QUERY/DML/DDL/DCL/AUTH/ADMIN) - Selective auditing by keyspace/table - Password sanitization in audit logs - Service level shares support (1-1000) for workload prioritization - Proper lifecycle management and cleanup I ran the dtests for audit (manually enabled) and they pass. The in-repo tests pass. Notably, there should be no non-whitespace changes between this and scylla-enterprise Fixes scylladb/scylla-enterprise#4999 Closes scylladb/scylladb#22147 * github.com:scylladb/scylladb: audit: Add shares support to service level management audit: Add service level support to CQL login process audit: Add support to CQL statements audit: Integrate audit subsystem into Scylla main process audit: Add documentation for the audit subsystem audit: Add the audit subsystem	2025-01-17 13:14:55 +01:00
Kamil Braun	89ee2a6834	Merge 'drop ip addresses from token metadata' from Gleb Now that all topology related code uses host ids there is not point to maintain ip to id (and back) mappings in the token metadata. After the patch the mapping will be maintained in the gossiper only. The rest of the system will use host ids and in rare cases where translation is needed (mostly for UX compatibility reasons) the translation will be done using gossiper. Fixes: scylladb/scylla#21777 * 'gleb/drop-ip-from-tm-v3' of github.com:scylladb/scylla-dev: (57 commits) hint manager: do not translate ip to id in case hint manager is stopped already locator: token_metadata: drop update_host_id() function that does nothing now locator: topology: drop indexing by ips repair: drop unneeded code storage_service: use host_id to look for a node in on_alive handler storage_proxy: translate ips to ids in forward array using gossiper locator: topology: remove unused functions storage_service: check for outdated ip in on_change notification in the peers table storage_proxy: translate id to ip using address map in tablets's describe_ring code instead of taking one from the topology topology coordinator: change connection dropping code to work on host ids cql3: report host id instead of ip in error during SELECT FROM MUTATION_FRAGMENTS query locator: drop unused function from tablet_effective_replication_map api: view_build_statuses: do not use IP from the topology, but translate id to ip using address map instead locator: token_metadata: remove unused ip based functions locator: network_topology_strategy: use host_id based function to check number of endpoints in dcs gossiper: drop get_unreachable_token_owners functions storage_service: use gossiper to map ip to id in node_ops operations storage_service: fix indentation after the last patch storage_service: drop loops from node ops replace_prepare handling since there can be only one replacing node token_metadata: drop no longer used functions ...	2025-01-17 11:00:52 +01:00
Kefu Chai	4a5a00347f	utils: do not include unused headers these unused includes were identifier by clang-include-cleaner. after auditing these source files, all of the reports have been confirmed. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22201	2025-01-17 11:24:54 +03:00
Botond Dénes	55963f8f79	replica: remove noexcept from token -> tablet resolution path The methods to resolve a key/token/range to a table are all noexcept. Yet the method below all of these, `storage_group_for_id()` can throw. This means that if due to any mistake a tablet without local replica is attempted to be looked up, it will result in a crash, as the exception bubbles up into the noexcept methods. There is no value in pretending that looking up the tablet replica is noexcept, remove the noexcept specifiers so that any bad lookup only fails the operation at hand and doesn't crash the node. This is especially relevant to replace, which still has a window where writes can arrive for tablets that don't (yet) have a local replica. Currently, this results in a crash. After this patch, this will only fail the writes and the replace can move on. Fixes: #21480 Closes scylladb/scylladb#22251	2025-01-17 11:24:09 +03:00
Łukasz Paszkowski	adef719c43	api/storage_service: Remove unimplemented truncate API The API /storage_service/truncate/{ks} returns an unimplemented error when invoked. As we already have a CQL command, `TRUNCATE TABLE ks.cf` that causes the table to be truncated on all nodes, the API can be dropped. Due to the error, it is unused. Fixes https://github.com/scylladb/scylladb/issues/10520 No backport is required. A small cleanup of not working API. Closes scylladb/scylladb#22258	2025-01-17 11:21:05 +03:00
Pavel Emelyanov	14c3fbbf8c	Merge 'sstable_directory: do not load remote unshared sstables in process_descriptor()' from Lakshmi Narayanan Sreethar The sstable loader relied on the generation id to provide an efficient hint about the shard that owns an sstable. But, this hint was rendered ineffective with the introduction of UUID generation, as the shard id was no longer embedded in the generation id. This also became suboptimal with the introduction of tablets. Commit `0c77f77` addressed this issue by reading the minimum from disk to determine sstable ownership but this improvement was lost with commit `63f1969`, which optimistically assumed that hints would work most of the time, which isn't true. This commit restores that change - shard id of a table is deduced by reading minially from disk and then the sstable is fully loaded only if it belongs to the local shard. This patch also adds a testcase to verify that the sstable are loaded only in their respective shards. Fixes #21015 This fixes a regression and should be backported. Closes scylladb/scylladb#22263 * github.com:scylladb/scylladb: sstable_directory: do not load remote sstables in process_descriptor sstable_directory: update `load_sstable()` definition sstable_directory: reintroduce `get_shards_for_this_sstable()`	2025-01-17 11:17:54 +03:00
Avi Kivity	d6f7f873d0	utils: config_file: don't use extern fully specialized variable templates Declaring-but-not-defining a fully specialized template is a great way to cut dependencies between users and providers, but unfortunately not supported for variable templates. Clang 18 does support it, but apparently it is a misinterpretation of the standard, and was removed in clang 19. We started using this non-feature in `7ed89266b3`. The fix is to use function templates. This is more verbose as each specialization needs to define a static variable to return, but is fully supported. Closes scylladb/scylladb#22299	2025-01-17 11:06:50 +03:00
Botond Dénes	2428f22d3e	Update tools/python3 submodule * tools/python3 fbf12d02...8415caf4 (1): > dist: Support FIPS mode	2025-01-17 09:17:29 +02:00
Tzach Livyatan	a00ab65491	remove BETA from metric and API reference Closes scylladb/scylladb#22092	2025-01-16 19:25:51 -05:00
Łukasz Paszkowski	aad46bd6f3	reader_concurrency_semaphore: do_wait_admission(): remove dumping diagnostics The commit `b39ca29b3c` introduced detection of admission-waiter anomaly and dumps permit diagnostics as soon as the semaphore did not admit readers even though it could. Later on, the commit `bf3d0b3543` introduces the optimization where the admission check is moved to the fiber processing the _read_list. Since the semaphore no longer admits readers as soon as it can, dumping diagnostic errors is not necessary as the situation is not abnormal. Closes scylladb/scylladb#22344	2025-01-16 19:23:43 -05:00
Nadav Har'El	955ac1b7b7	test/alternator: close boto3 client before shutting down For several years now, we have seen a strange, and very rare, flakiness in Alternator tests described in issue #17564: We see all the test pass, pytest declares them to have passed, and while Python is existing, it crashes with a signal 11 (SIGSEGV). Because this happens exclusively in test/alternator and never in the test/cqlpy, we suspect that something that the test/alternator leaves behind but test/cqlpy does not, causes some race and crashes during shutdown. The immediate suspect is the boto3 library, or rather, the urllib3 library which it uses. This is more-or-less the only thing that test/alternator does which test/cqlpy doesn't. The urllib3 library keeps around pools of reusable connections, and it's possible (although I don't actually have any proof for it) that these open connections may cause a crash during shutdown. So in this patch I add to the "dynamodb" and "dynamodbstreams" fixtures (which all Alternator tests use to connect to the server), a teardown which calls close() for the boto3 client object. This close() call percolates down to calling clear() on urllib3's PoolManager. Hopefully, this will make some difference in the chance to crash during shutdown - and if it doesn't, it won't hurt. Refs #17564 Closes scylladb/scylladb#22341	2025-01-16 19:21:00 -05:00
Gleb Natapov	a40e810442	hint manager: do not translate ip to id in case hint manager is stopped already Since we do not stop storage proxy on shutdown this code can be called during shutdown when address map is no longer usable.	2025-01-16 16:37:08 +02:00
Gleb Natapov	1e4b2f25dc	locator: token_metadata: drop update_host_id() function that does nothing now	2025-01-16 16:37:08 +02:00
Gleb Natapov	50fb22c8f9	locator: topology: drop indexing by ips Do not track id to ip mapping in the topology class any longer. There are no remaining users.	2025-01-16 16:37:08 +02:00
Gleb Natapov	f9df092fd1	repair: drop unneeded code There is a code that creates a map from id to ip and then creates a vector from the keys of the map. Create a vector directly instead.	2025-01-16 16:37:08 +02:00
Gleb Natapov	12da203cae	storage_service: use host_id to look for a node in on_alive handler	2025-01-16 16:37:08 +02:00
Gleb Natapov	d45ce6fa12	storage_proxy: translate ips to ids in forward array using gossiper We already use it to translate reply_to, so do it for consistency and to drop ip based API usage.	2025-01-16 16:37:08 +02:00
Gleb Natapov	db73758655	locator: topology: remove unused functions	2025-01-16 16:37:07 +02:00
Gleb Natapov	fb28ff5176	storage_service: check for outdated ip in on_change notification in the peers table The code checks that it does not run for an ip address that is no longer in use (after ip address change). To check that we can use peers table and see if the host id is mapped to the address. If yes, this is the latest address for this host id otherwise this is an outdated entry.	2025-01-16 16:37:07 +02:00
Gleb Natapov	163099678e	storage_proxy: translate id to ip using address map in tablets's describe_ring code instead of taking one from the topology We want to drop ip from the locator::node.	2025-01-16 16:37:07 +02:00
Gleb Natapov	49fa1130ef	topology coordinator: change connection dropping code to work on host ids Do not use ip from topology::node, but look it up in address map instead. We want to drop ip from the topology::node.	2025-01-16 16:37:07 +02:00
Gleb Natapov	83d15b8e32	cql3: report host id instead of ip in error during SELECT FROM MUTATION_FRAGMENTS query We want to drop ip from the topology::node.	2025-01-16 16:37:07 +02:00
Gleb Natapov	5cd3627baa	locator: drop unused function from tablet_effective_replication_map	2025-01-16 16:37:07 +02:00
Gleb Natapov	122d58b4ad	api: view_build_statuses: do not use IP from the topology, but translate id to ip using address map instead	2025-01-16 16:37:07 +02:00
Gleb Natapov	97f95f1dbd	locator: token_metadata: remove unused ip based functions	2025-01-16 16:37:07 +02:00
Gleb Natapov	3068e38baa	locator: network_topology_strategy: use host_id based function to check number of endpoints in dcs	2025-01-16 16:37:07 +02:00
Gleb Natapov	0ec9f7de64	gossiper: drop get_unreachable_token_owners functions It is used by truncate code only and even there it only check if the returned set is not empty. Check for dead token owners in the truncation code directly.	2025-01-16 16:37:07 +02:00
Gleb Natapov	a7a7cdcf42	storage_service: use gossiper to map ip to id in node_ops operations Replace operation is special though. In case of replacing with the same IP the gossiper will not have the mapping, and node_ops RPC unfortunately does not send host id of a replaced node. For replace we consult peers table instead to find the old owner of the IP. A node that is replacing (the coordinator of the replace) will not have it though, but luckily it is not needed since it updates metadata during join_topology() anyway. The only thing that is missing there is add_replacing_endpoint() call which the patch adds.	2025-01-16 16:37:07 +02:00
Gleb Natapov	0db6136fa5	storage_service: fix indentation after the last patch	2025-01-16 16:37:07 +02:00
Gleb Natapov	9197b88e48	storage_service: drop loops from node ops replace_prepare handling since there can be only one replacing node The call already throw an error if there are more than one. Throw is there are zero as well and drop the loops.	2025-01-16 16:37:07 +02:00
Gleb Natapov	fcfd005023	token_metadata: drop no longer used functions	2025-01-16 16:37:07 +02:00
Gleb Natapov	7c4c485651	host_id_or_endpoint: use gossiper to resolve ip to id and back mappings host_id_or_endpoint is a helper class that hold either id or ip and translate one into another on demand. Use gossiper to do a translation there instead of token_metadata since we want to drop ip based APIs from the later.	2025-01-16 16:37:07 +02:00
Gleb Natapov	70cc014307	storage_service: ip_address_updater: check peers table instead of token_metadata whether ip was changed As part of changing IP address peers table is updated. If it has a new address the update can be skipped.	2025-01-16 16:37:07 +02:00
Gleb Natapov	8e55cc6c78	storage_service: fix logging When logger outputs a range it already does join, so no other join is needed.	2025-01-16 16:37:07 +02:00
Gleb Natapov	7556e3d045	topology coordinator: remove gossiper entry only if host id matches provided one Currently the entry is removed only if ip is not used by any normal or transitioning node. This is done to not remove a wrong entry that just happen to use the same ip, but the same can be achieved by checking host id in the entry.	2025-01-16 16:37:07 +02:00
Gleb Natapov	593308a051	node_ops, cdc: drop remaining token_metadata::get_endpoint_for_host_id() usage Use address map to translate id to ip instead. We want to drop ips from token_metadata.	2025-01-16 16:37:07 +02:00
Gleb Natapov	ae8dc595e1	hints: move id to ip translation into store_hint() function Also use gossiper to translate instead of token_metadata since we want to get rid of ip base APIs there.	2025-01-16 16:37:06 +02:00
Gleb Natapov	c7d08fe1fe	storage_service: change get_dc_rack_for() to work on host ids	2025-01-16 16:37:06 +02:00
Gleb Natapov	415e8de36e	locator: topology: change get_datacenter_endpoints and get_datacenter_racks to return host ids and amend users	2025-01-16 16:37:06 +02:00
Gleb Natapov	8a0fea5fef	locator: topology: drop is_me ip overload along with remaning users	2025-01-16 16:37:06 +02:00
Gleb Natapov	2ea8df2cf5	storage_proxy: drop is_alive that works on ip since it is not used any more	2025-01-16 16:37:06 +02:00
Gleb Natapov	8433947932	locator: topology: remove get_location overload that works on ip and its last users	2025-01-16 16:37:06 +02:00
Gleb Natapov	25eb98ecbc	locator: topology: drop no longer used ip based overloads	2025-01-16 16:37:06 +02:00
Gleb Natapov	315db647dd	consistency_level: drop templates since the same types of ranges are used by all the callers	2025-01-16 16:37:06 +02:00

1 2 3 4 5 ...

46278 Commits