scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 18:50:53 +00:00

Author	SHA1	Message	Date
Michał Hudobski	40d180a7ef	docs: update vector search filtering to reflect primary key support only Remove outdated references to filtering on columns provided in the index definition, and remove the note about equal relations (= and IN) being the only supported operations. Vector search filtering currently supports WHERE clauses on primary key columns only. Closes scylladb/scylladb#28949	2026-03-16 17:16:16 +02:00
Botond Dénes	fc8cebd671	Merge 'Verify components digests during component load and scrub in validate mode' from Taras Veretilnyk This PR adds integrity verification for SSTable component files during loading. When component digests are present in Scylla metadata, the loader now validates each component's CRC32 digest against the stored expected value, catching silent corruption of component files. Index, Rows and Partitions components digests are also validated duriung scrub in validate mode Added corruption tests that write an SSTable, flip a bit in a specific component file, then verify that reloading the SSTable detects the corruption and throws the expected exception. Depends on https://github.com/scylladb/scylladb/pull/28338 Backport is not required, this is new feature Fixes https://github.com/scylladb/scylladb/issues/20103 Closes scylladb/scylladb#28761 * github.com:scylladb/scylladb: test/cqlpy: test --ignore-component-digest-mismatch flag in scylla sstable upgrade docs: document --ignore-component-digest-mismatch flag for scylla sstable upgrade sstables: propagate ignore_component_digest_mismatch config to all load sites sstables: add option to ignore component digest mismatches sstable_compaction_test: Add scrub validate test for corrupted index sstables: add tests for component digest validation on corrupted SSTables sstables: validate index components digests during SSTable scrub in validate mode sstables: verify component digests on SSTable load sstables: add digest_file_random_access_reader for CRC32 digest computation	2026-03-13 09:55:55 +02:00
David	79f9967eaa	docs: update theme 1.9 Motivation Upgrades Sphinx to 9.x, MyST Parser to 5.x, Python to 3.11+–3.14, Node.js to 22, and replaces Poetry with uv for dependency management. Changelog: https://github.com/scylladb/sphinx-scylladb-theme/blob/master/docs/source/upgrade/CHANGELOG.md#190---26-february-2026 How to test * Make sure you are using Python 3.11-3.14: * python --version * Install uv: * make setupenv * Build the docs: * make preview * Docs should render without errors at http://127.0.0.1:5500 Closes scylladb/scylladb#28971	2026-03-11 16:56:51 +02:00
Dmitriy Kruglov	cee44716db	docs: add cluster platform migration procedure Document how to migrate a ScyllaDB cluster to different instance types using the add-and-replace node cycling approach. Closes: QAINFRA-42 Closes scylladb/scylladb#28458	2026-03-11 09:31:35 +02:00
Taras Veretilnyk	fc4c82b962	docs: document --ignore-component-digest-mismatch flag for scylla sstable upgrade	2026-03-10 19:24:05 +01:00
Botond Dénes	81e214237f	Merge 'Add digests for all sstable components in scylla metadata' from Taras Veretilnyk This pull request adds support for calculation and storing CRC32 digests for all SSTable components. This change replaces plain file_writer with crc32_digest_file_writer for all SSTable components that should be checksummed. The resulting component digests are stored in the sstable structure and later persisted to disk as part of the Scylla metadata component during writer::consume_end_of_stream. Several test cases where introduced to verify expected behaviour. Additionally, this PR adds new rewrite component mechanism for safe sstable component rewriting. Previously, rewriting an sstable component (e.g., via rewrite_statistics) created a temporary file that was renamed to the final name after sealing. This allowed crash recovery by simply removing the temporary file on startup. However, with component digests stored in scylla_metadata (#20100), replacing a component like Statistics requires atomically updating both the component and scylla_metadata with the new digest - impossible with POSIX rename. The new mechanism creates a clone sstable with a fresh generation: - Hard-links all components from the source except the component being rewritten and scylla_metadata - Copies original sstable components pointer and recognized components from the source - Invokes a modifier callback to adjust the new sstable before rewriting - Writes the modified component along with updated scylla_metadata containing the new digest - Seals the new sstable with a temporary TOC - Replaces the old sstable atomically, the same way as it is done in compaction This is built on the rewrite_sstables compaction framework to support batch operations (e.g., following incremental repair). In case of any failure durning the whole process, sstable will be automatically deleted on the node startup due to temporary toc persistence. Backport is not required, it is a new feature Fixes https://github.com/scylladb/scylladb/issues/20100, https://github.com/scylladb/scylladb/issues/27453 Closes scylladb/scylladb#28338 * github.com:scylladb/scylladb: docs: document components_digests subcomponent and trailing digest in Scylla.db sstable_compaction_test: Add tests for perform_component_rewrite sstable_test: add verification testcases of SSTable components digests persistance sstables: store digest of all sstable components in scylla metadata sstables: replace rewrite_statistics with new rewrite component mechanism sstables: add new rewrite component mechanism for safe sstable component rewriting compaction: add compaction_group_view method to specify sstable version sstables: add null_data_sink and serialized_checksum for checksum-only calculation sstables: extract default write open flags into a constant sstables: Add write_simple_with_digest for component checksumming sstables: Extract file writer closing logic into separate methods sstables: Implement CRC32 digest-only writer	2026-03-10 16:02:53 +02:00
Marcin Maliszkiewicz	b6a7484520	docs: note eventual visibility of auth changes Mention that role and permission changes are durable but may not be immediately visible on other nodes due to asynchronous replication. Fixes: SCYLLADB-651 Closes scylladb/scylladb#28900	2026-03-09 14:07:10 +01:00
Taras Veretilnyk	739dd59ebc	docs: document components_digests subcomponent and trailing digest in Scylla.db Document the new `components_digests` subcomponent (tag 12) added to the Scylla.db metadata component, which stores CRC32 digests of all checksummed SSTable component files. Also document the trailing CRC32 digest that stores digest of the scylla metadata itself.	2026-03-06 21:58:15 +01:00
Anna Stuchlik	855c503c63	doc: fix the unified installer instructions This commit updates the documentation for the unified installer. - The Open Source example is replaced with version 2025.1 (Source Available, currently supported, LTS). - The info about CentOS 7 is removed (no longer supported). - Java 8 is removed. - The example for cassandra-stress is removed (as it was already removed on other installation pages). Fixes https://github.com/scylladb/scylladb/issues/28150 Closes scylladb/scylladb#28152	2026-03-05 12:57:06 +02:00
Dawid Mędrek	7564a56dc8	Merge 'tombstone_gc: allow using repair-mode tombstone gc with RF=1 tables' from Botond Dénes Currently, repair-mode tombstone-gc cannot be used on tables with RF=1. We want to make repair-mode the default for all tablet tables (and more, see https://github.com/scylladb/scylladb/issues/22814), but currently a keyspace created with RF=1 and later altered to RF>1 will end up using timeout-mode tombstone gc. This is because the repair-mode tombstone-gc code relies on repair history to determine the gc-before time for keys/ranges. RF=1 tables cannot run repairs so they will have empty repair history and consequently won't be able to purge tombstones. This PR solves this by keeping a registry of RF=1 tables and consulting this registry when creating `tombstone_gc_state` objects. If the table is RF=1, tombstone-gc will work as if the table used immediate-mode tombstone-gc. The registry is updated on each replication update. As soon as the table is not RF=1 anymore, the tombstone-gc reverts to the natural repair-mode behaviour. After this PR, tombstone-gc defaults to repair-mode for all tables, regardless of RF and tablets/vnodes. Fixes: SCYLLADB-106. New feature, no backport required. Closes scylladb/scylladb#22945 * github.com:scylladb/scylladb: test/{boost,cluster}: add test for tombstone gc mode=repair with RF=1 tombstone_gc: allow use of repair-mode for RF=1 tables replica/table: update rf=1 table registry in shared tombstone-gc state tombstone_gc: tombstone_gc_before_getter: consider RF when getting gc before time tombstone_gc: unpack per_table_history_maps tombstone_gc: extract _group0_gc_time from per_table_history_map tombstone_gc: drop tombstone_gc_state(nullptr) ctor and operator bool() test/lib/random_schema: use timeout-mode tombstone_gc tombstone_gc_options: add C++ friendly constructor test: move away from tombstone_gc_state(nullptr) ctor treewide: move away from tombstone_gc_state(nullptr) ctor sstable: move away from tombstone_gc_mode::operator bool() replica/table: add get_tombstone_gc_state() compaction: use tombstone_gc_state with value semantics db/row_cache: use tombstone_gc_state with value semantics tombstone_gc: introduce tombstone_gc_state::for_tests()	2026-03-05 11:50:31 +01:00
Marcin Maliszkiewicz	c7d3f80863	Merge 'auth: do not create default 'cassandra:cassandra' superuser' from Dario Mirovic This patch series removes creation of default 'cassandra:cassandra' superuser on system start. Disable creation of a superuser with default 'cassandra:cassandra' credentials to improve security. The current flow requires clients to create another superuser and then drop the default `cassandra:cassandra' role. For those who do, there is a time window where the default credentials exist. For those who do not, that role stays. We want to improve security by forcing the client to either use config to specify default values for default superuser name and password or use cqlsh over maintenance socket connection to explicitly create/alter a superuser role. The patch series: - Enable role modification over the maintenance socket - Stop using default 'cassandra' value for default superuser, skipping creation instead Design document: https://scylladb.atlassian.net/wiki/spaces/RND/pages/165773327/Drop+default+cassandra+superuser Fixes scylladb/scylla-enterprise#5657 This is an improvement. It does not need a backport. Closes scylladb/scylladb#27215 * github.com:scylladb/scylladb: config: enable maintenance socket in workdir by default docs: auth: do not specify password with -p option docs: update documentation related to default superuser test: maintenance socket role management test: cluster: add logs to test_maintenance_socket.py test: pylib: fix connect_driver handling when adding and starting server auth: do not create default 'cassandra:cassandra' superuser auth: remove redundant DEFAULT_USER_NAME from password authenticator auth: enable role management operations via maintenance socket client_state: add has_superuser method client_state: add _bypass_auth_checks flag auth: let maintenance_socket_role_manager know if node is in maintenance mode auth: remove class registrator usage auth: instantiate auth service with factory functors auth: add service constructor with factory functors auth: add transitional.hh file service: qos: handle special scheduling group case for maintenance socket service: qos: use _auth_integration as condition for using _auth_integration	2026-03-04 09:43:57 +01:00
Botond Dénes	5998a859f7	tombstone_gc: allow use of repair-mode for RF=1 tables Modify the methods which calculate the default gc mode as well as that which validates whether repair-mode can be used at all, so both accepts use of repair-mode on RF=1 tables. This de-facto changes the default tombstone-gc to repair-mode for all tables. Documentation is updated accordingly. Some tests need adjusting: * cqlpy/test_select_from_mutation_fragments.py: disable GC for some test cases because this patch makes tombstones they write subject to GC when using defaults. * test/cluster/test_mv.py::test_mv_tombstone_gc_not_inherited used repair-mode as a non-default for the base table and expected the MV to revert to default. Another mode has to be used as the non-default (immediate). * test/cqlpy/test_tools.py::test_scylla_sstable_dump_schema: don't compare tombstone_gc schema extension when comparing dumped schema vs. original. The tool's schema loader doesn't have access to the keyspace definition so it will come up with different defaults for tombstone-gc. * test/boost/row_cache_test.cc::test_populating_cache_with_expired_and_nonexpired_tombstones sets tombstone expiry assuming the tombstone-gc timeout-mode default. Change the CREATE TABLE statement to set the expected mode.	2026-03-04 09:44:24 +02:00
Avi Kivity	85bd6d0114	Merge 'Add multiple-shard persistent metadata storage for strongly consistent tables' from Wojciech Mitros In this series we introduce new system tables and use them for storing the raft metadata for strongly consistent tables. In contrast to the previously used raft group0 tables, the new tables can store data on any shard. The tables also allow specifying the shard where each partition should reside, which enables the tablets of strongly consistent tables to have their raft group metadata co-located on the same shard as the tablet replica. The new tables have almost the same schemas as the raft group0 tables. However, they have an additional column in their partition keys. The additional column is the shard that specifies where the data should be located. While a tablet and its corresponding raft group server resides on some shard, it now writes and reads all requests to the metadata tables using its shard in addition to the group_id. The extra partition key column is used by the new partitioner and sharder which allow this special shard routing. The partitioner encodes the shard in the token and the sharder decodes the shard from the token. This approach for routing avoids any additional lookups (for the tablet mapping) during operations on the new tables and it also doesn't require keeping any state. It also doesn't interact negatively with resharding - as long as tablets (and their corresponding raft metadata) occupy some shard, we do not allow starting the node with a shard count lower than the id of this shard. When increasing the shard count, the routing does not change, similarly to how tablet allocation doesn't change. To use the new tables, a new implementation of `raft::persistence` is added. Currently, it's almost an exact copy of the `raft_sys_table_storage` which just uses the new tables, but in the future we can modify it with changes specific to metadata (or mutation) storage for strongly consistent tables. The new storage is used in the `groups_manager`, which combined with the removal of some `this_shard_id() == 0` checks, allows strongly consistent tables to be used on all shards. This approach for making sure that the reads/writes to the new tables end up on the correct shards won in the balance of complexity/usability/performance against a few other approaches we've considered. They include: 1. Making the Raft server read/write directly to the database, skipping the sharder, on its shard, while using the default partitioner/sharder. This approach could let us avoid changing the schema and there should be no problems for reads and writes performed by the Raft server. However, in this approach we would input data in tables conflicting with the placement determined by the sharder. As a result, any read going through the sharder could miss the rows it was supposed to read. Even when reading all shards to find a specific value, there is a risk of polluting the cache - the rows loaded on incorrect shards may persist in the cache for an unknown amount of time. The cache may also mistakenly remember that a row is missing, even though it's actually present, just on an incorrect shard. Some of the issues with this approach could be worked around using another sharder which always returns this_shard_id() when asked about a shard. It's not clear how such a sharder would implement a method like `token_for_next_shard`, and how much simpler it would be compared to the current "identity" sharder. 2. Using a sharder depending on the current allocation of tablets on the node. This approach relies on the knowledge of group_id -> shard mapping at any point in time in the cluster. For this approach we'd also need to either add a custom partitioner which encodes the group_id in the token, or we'd need to track the token(group_id) -> shard mapping. This approach has the benefit over the one used in the series of keeping the partition key as just group_id. However, it requires more logic, and the access to the live state of the node in the sharder, and it's not static - the same token may be sharded differently depending on the state of the node - it shouldn't occur in practice, but if we changed the state of the node before adjusting the table data, we would be unable to access/fix the stale data without artificially also changing the state of the node. 3. Using metadata tables co-located to the strongly consistent tables. This approach could simplify the metadata migrations in the future, however it would require additional schema management of all co-located metadata tables, and it's not even obvious what could be used as the partition key in these tables - some metadata is per-raft-group, so we couldn't reuse the partition key of the strongly consistent table for it. And finding and remembering a partition key that is routed to a specific shard is not a simple task. Finally, splits and merges will most likely need special handling for metadata anyway, so we wouldn't even make use of co-located table's splits and merges. Fixes [SCYLLADB-361](https://scylladb.atlassian.net/browse/SCYLLADB-361) [SCYLLADB-361]: https://scylladb.atlassian.net/browse/SCYLLADB-361?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ Closes scylladb/scylladb#28509 * github.com:scylladb/scylladb: docs: add strong consistency doc test/cluster: add tests for strongly-consistent tables' metadata persistence raft: enable multi-shard raft groups for strongly consistent tablets test/raft: add unit tests for raft_groups_storage raft: add raft_groups_storage persistence class db: add system tables for strongly consistent tables' raft groups dht: add fixed_shard_partitioner and fixed_shard_sharder raft: add group_id -> shard mapping to raft_group_registry schema: add with_sharder overload accepting static_sharder reference	2026-03-04 08:55:43 +02:00
Geoff Montee	0eb5603ebd	Docs: describe the system tables Fixes issue #12818 with the following docs changes: docs/dev/system_keyspace.md: Added missing system tables, added table of contents (TOC), added categories Closes scylladb/scylladb#27789	2026-03-04 08:55:43 +02:00
Dario Mirovic	6e83fb5029	docs: auth: do not specify password with -p option Specifying password with -p option is considered unsafe. The password will be saved in bash history. The preferred approach is to enter the password when prompted. Any approach that passes the password via command line arguments makes that password visible in process options (ps command), no matter if the password is passed directly or as an environment variable. Refs SCYLLADB-409	2026-03-04 00:01:07 +01:00
Dario Mirovic	afafb8a8fa	docs: update documentation related to default superuser Update create superuser procedure: - Remove notes about default `cassandra` superuser - Add create superuser using existing superuser section - Update create superuser by using `scylla.yaml` config - Add create superuser using maintenance socket Update password reset procedure: - Add maintenance socket approach - Remove the old approach with deleting all the roles Update enabling authentication with downtime and during runtime: - Mention creating new superuser over the maintenance socket - Remove default superuser usage Update enable authorization: - Mention creating new superuser over the maintenance socket - Remove mention of default superuser Reasoning for deletion of the old approach: - [old] Needs cluster downtime, removes all roles, needs recreation of roles, needs maintenance socket anyways, if config values are not used for superuser - [new] No cluster downtime, possibly one node restart to enable maintenance socket, faster Refs SCYLLADB-409	2026-03-04 00:01:07 +01:00
Dimitrios Symonidis	80b74d7df2	tablet options: Add max_tablet_count tablet option to enforce tablet count upper bounds Introduced a new max_tablet_count tablet option that caps the maximum number of tablets a table can have. This feature is designed primarily for backup and restore workflows. During backup, when load balancing is disabled for snapshot consistency, the current tablet count is recorded in the backup manifest. During restore, max_tablet_count is set to this recorded value, ensuring the restored table's tablet count never exceeds the original snapshot's tablet distribution. This guarantee enables efficient file-based SSTable streaming during restore, as each SSTable remains fully contained within a single tablet boundary. Closes scylladb/scylladb#28450	2026-03-03 11:19:24 +03:00
Sergey Zolotukhin	33923578eb	Update DROP INDEX statement documentation Clarify behavior of DROP INDEX during ongoing builds. Closes scylladb/scylladb#28659	2026-03-02 17:31:23 +02:00
Botond Dénes	ab532882db	tools/scylla-sstable: introduce scylla sstable split Split input sstable(s) into multiple output sstables based on the provided token boundaries. The input sstable(s) are divided according to the specified split tokens, creating one output sstable per token range. Fixes: SCYLLADB-10 Closes scylladb/scylladb#28741	2026-03-02 15:19:17 +01:00
dependabot[bot]	f5fa77ac9d	build(deps): bump sphinx-multiversion-scylla in /docs Bumps [sphinx-multiversion-scylla](https://holzhaus.github.io/sphinx-multiversion/) from 0.3.4 to 0.3.7. --- updated-dependencies: - dependency-name: sphinx-multiversion-scylla dependency-version: 0.3.7 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Closes scylladb/scylladb#28833	2026-03-02 14:13:03 +01:00
Anna Stuchlik	dfd46ad3fb	doc: add the upgrade guide from 2025.x to 2026.1 This commit adds the upgrade guide for version 2026.1. According to the new upgrade policy, the user can now upgrade to the major version (2026.1) from any previous minor version. So instead of adding a separate guide form 2025.4 to 2026.1, we need a guide from 2025.x to 2026.1. In addition, this commit: - Updates the upgrade policy for reflect the above change. - Removes the upgrade guides for the previous version. Fixes https://github.com/scylladb/scylladb/issues/28533 Fixes https://github.com/scylladb/scylladb/issues/28532 Closes scylladb/scylladb#28789	2026-02-27 15:36:34 +02:00
Nadav Har'El	34c0c64d9d	docs/cql: document the new CQL per-row TTL feature Add user-facing documentation for the new CQL per-row TTL feature, in docs/cql/cql-extensions.md. Also mention (and link) the new alternative TTL feature in a few relevant documents about the old (per-write) TTL, about CDC, and about the CREATE TABLE and ALTER TABLE commands. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-02-25 14:59:44 +02:00
Wojciech Mitros	d1ff8f1db3	docs: add strong consistency doc Add a new docs/dev document for the strongly consistent tables feature. For now, it only contains information about the Raft metadata persistence, but it should be updated as more of the strong-consistency components are added.	2026-02-25 12:34:58 +01:00
Anna Stuchlik	64b1798513	doc: remove reduntant Java-related information This commit removes: - Instructions to install scylla-jmx (and all references) - The Java 11 requirement for Ubuntu. Fixes https://github.com/scylladb/scylladb/issues/28249 Fixes https://github.com/scylladb/scylladb/issues/28252 Closes scylladb/scylladb#28254	2026-02-24 14:37:39 +01:00
Anna Stuchlik	e2333a57ad	doc: remove the tablets limitation for Alternator This commit removes the information that Alternator doesn't support tablets. The limitation is no longer valid. Fixes SCYLLADB-778 Closes scylladb/scylladb#28781	2026-02-24 14:24:30 +02:00
Aleksandra Martyniuk	9ccc95808f	docs: update nodetool rebuild docs Update nodetool rebuild docs to mention that the command does not work for tablet keyspaces. Fixes: https://github.com/scylladb/scylladb/issues/28270.	2026-02-23 12:45:01 +01:00
Aleksandra Martyniuk	e4c42acd8f	docs: update a procedure of decommissioning a DC Update a procedure of decommissioning a DC for tablet keyspaces. Fixes: https://github.com/scylladb/scylladb/issues/28307.	2026-02-23 12:45:01 +01:00
Aleksandra Martyniuk	1c764cf6ea	docs: update a procedure of adding a DC Update a procedure of adding a DC for tablet keyspaces. Fixes: https://github.com/scylladb/scylladb/issues/28306.	2026-02-23 12:45:01 +01:00
Aleksandra Martyniuk	e08ac60161	docs: describe upgrade to enforce_rack_list option	2026-02-23 12:44:57 +01:00
Aleksandra Martyniuk	eefe66b2b2	docs: describe conversion to rack-list RF Fixes: SCYLLADB-398	2026-02-23 12:41:33 +01:00
Botond Dénes	7e90ed657c	Merge 'Fix `client_options` docs' from Karol Baryła https://github.com/scylladb/scylladb/pull/25746 added a new column to `system.clients`: `client_options frozen<map<text, text>>`. This column stores all options sent by the client in the `STARTUP` message. This PR also added `CLIENT_OPTIONS` to the list of values sent in `SUPPORTED` message, and documented that drivers can send their configuration (as JSON) in `STARTUP` under this key. Documentation for the new column was not added to the description of `system.clients` table, and documentation about the new `STARTUP` key was added in `protocol-extensions.md`, but in the section about shard awareness extension. This PR adds missing `system.clients` column description, moves the documentation of `CLIENT_OPTIONS` into its own section, and expands it a bit. Backport: none, because this fixes internal documentation. Closes scylladb/scylladb#28126 * github.com:scylladb/scylladb: protocol-extensions.md: Fix client_options docs system_keyspace.md: Add client_options column system_keyspace.md: Fix order in system.clients	2026-02-20 14:23:34 +02:00
Marcin Maliszkiewicz	22c3d8d609	Merge 'db/config: enable table audit by default' from Piotr Smaron In https://github.com/scylladb/scylladb/pull/27262 table audit has been re-enabled by default in `scylla.yaml`, logging certain categories to a table, which should make new Scylla deployments have audit enabled. Now, in the next release, we also want to enable audit in `db/config.cc`, which should enable audit for all deployments, which don't explicitly configure audit otherwise in `scylla.yaml` (or via cmd line). BTW. Because this commit aligns audit's default config values in `db/config.cc` to those of `scylla.yaml`, `docs/reference/configuration-parameters.rst`, which is based on `db/config.cc` will start showing that table audit is the default. Refs: https://github.com/scylladb/scylladb/issues/28355 Refs: https://scylladb.atlassian.net/browse/SCYLLADB-222 No backport: table audit has been enabled in 2026.1 in `scylla.yaml`, and should be always on starting from the next release, which is the release we're currently merging to (2026.2). Closes scylladb/scylladb#28376 * github.com:scylladb/scylladb: docs: decommission: note audit ks may require ALTERing docs: mention table audit enabled by default audit: disable DDL by default db/config: enable table audit by default test/cluster: fix `test_table_desc_read_barrier` assertion test/cluster: adjust audit in tests involving decommissioning its ks audit_test: fix incorrect config in `test_audit_type_none`	2026-02-19 16:30:11 +01:00
Piotr Smaron	797c5cd401	docs: decommission: note audit ks may require ALTERing With audit feature enabled, it's not immediately obvious that its pseudo-system keyspace `audit` may require adjusting its RF across DCs before decommissioning a node, and this should be documented.	2026-02-18 15:14:57 +01:00
Piotr Smaron	65eec6d8e7	docs: mention table audit enabled by default Also align the documentation with the current audit settings.	2026-02-18 15:14:57 +01:00
Andrzej Jackowski	4221d9bbfd	docs: improve examples in `Handling Audit Failures` section This commit introduces four changes: - In the `table` example, singular forms (node, partition) are changed to plural forms (nodes, partitions). Currently, the default `table` audit configuration is RF=3 and writes use CL=ONE. Therefore, a `table` audit log write failure should not be caused by a single node unavailability, and plural forms are more adequate. - In the `table` example, unreachability due to network issues is mentioned because with RF=3, audit failure due to network problems is more likely to happen than a simultaneous failure of three nodes (such network failures happened in SCYLLADB-706). - In the `syslog` example, a slash `/` is changed to `or`, so `table` and `syslog` examples have similar structure. - As the `syslog` line is already being changed, I also change `unix` to `Unix`, as the capitalized form is the correct one. Refs SCYLLADB-706 Closes scylladb/scylladb#28702	2026-02-18 13:10:01 +01:00
Marcin Maliszkiewicz	a059798de9	docs: conf: update permissions cache documentation	2026-02-17 18:18:40 +01:00
Marcin Maliszkiewicz	9d9184e5b7	auth: use unified cache for permissions	2026-02-17 17:56:27 +01:00
Anna Stuchlik	dc8f7c9d62	doc: replace the OS Support page with a link to the new location We've moved that page to another place; see https://github.com/scylladb/scylladb/issues/28561. This commit replaces the page with the link to the new location and adds a redirection. Fixes https://github.com/scylladb/scylladb/issues/28561 Closes scylladb/scylladb#28562	2026-02-06 11:38:21 +02:00
Anna Stuchlik	a427ad3bf9	doc: remove the link to the Open Source blog post Fixes https://github.com/scylladb/scylladb/issues/28486 Closes scylladb/scylladb#28518	2026-02-03 14:15:16 +01:00
Anna Stuchlik	77480c9d8f	doc: fix the links on the repair-related pages This is a follow-up to https://github.com/scylladb/scylladb/pull/28199. This commit fixes the syntax of the internal links. Fixes https://github.com/scylladb/scylladb/issues/28486 Closes scylladb/scylladb#28487	2026-02-03 06:54:08 +02:00
Avi Kivity	cc03f5c89d	cql3: support literals and bind variables in selectors Add support for literals in the SELECT clause. This allows SELECT fn(column, 4) or SELECT fn(column, ?). Note, "SELECT 7 FROM tab" becomes valid in the grammar, but is still not accepted because of failed type inference - we cannot infer the type of 7, and don't have a favored type for literals (like C favors int). We might relax this later. In the WHERE clause, and Cassandra in the SELECT clause, type hints can also resolve type ambiguity: (bigint)7 or (text)?. But this is deferred to a later patch. A few changes to the grammar are needed on top of adding a `value` alternative to `unaliasedSelector`: - vectorSimilarityArg gained access to `value` via `unaliasedSelector`, so it loses that alternate to avoid ambiguity. We may drop `vectorSimilarityArg` later. - COUNT(1) became ambiguous via the general function path (since function arguments can now be literals), so we remove this case from the COUNT special cases, remaining with count(*). - SELECT JSON and SELECT DISTINCT became "ambiguous enough" for ANTLR to complain, though as far as I can tell `value` does not add real ambiguity. The solution is to commit early (via "=>") to a parsing path. Due to the loss of count(1) recognition in the parser, we have to special-case it in prepare. We may relax it to count any expression later, like modern Cassandra and SQL. Testing is awkward because of the type inference problem in top-level. We test via the set_intersection() function and via lua functions. Example: ``` cqlsh> CREATE FUNCTION ks.sum(a int, b int) RETURNS NULL ON NULL INPUT RETURNS int LANGUAGE LUA AS 'return a + b'; cqlsh> SELECT ks.sum(1, 2) FROM system.local; ks.sum(1, 2) -------------- 3 (1 rows) cqlsh> ``` (There are no suitable system functions!) Fixes https://scylladb.atlassian.net/browse/SCYLLADB-296 Closes scylladb/scylladb#28256	2026-02-02 00:06:13 +02:00
Pawel Pery	f49c9e896a	vector_search: allow full secondary indexes syntax while creating the vector index Vector Search feature needs to support creating vector indexes with additional filtering column. There will be two types of indexes: global which indexes vectors per table, and local which indexes vectors per partition key. The new syntaxes are based on ScyllaDB's Global Secondary Index and Local Secondary Index. Vector indexes don't use secondary indexes functionalities in any way - all indexing, filtering and processing data will be done on Vector Store side. This patch allows creating vector indexes using this CQL syntax: ``` CREATE TABLE IF NOT EXISTS cycling.comments_vs ( commenter text, comment text, comment_vector VECTOR <FLOAT, 5>, created_at timestamp, discussion_board_id int, country text, lang text, PRIMARY KEY ((commenter, discussion_board_id), created_at) ); CREATE CUSTOM INDEX IF NOT EXISTS global_ann_index ON cycling.comments_vs(comment_vector, country, lang) USING 'vector_index' WITH OPTIONS = { 'similarity_function': 'DOT_PRODUCT' }; CREATE CUSTOM INDEX IF NOT EXISTS local_ann_index ON cycling.comments_vs((commenter, discussion_board_id), comment_vector, country, lang) USING 'vector_index' WITH OPTIONS = { 'similarity_function': 'DOT_PRODUCT' }; ``` Currently, if we run these queries to create indexes we will receive such errors: ``` InvalidRequest: Error from server: code=2200 [Invalid query] message="Vector index can only be created on a single column" InvalidRequest: Error from server: code=2200 [Invalid query] message="Local index definition must contain full partition key only. Redundant column: XYZ" ``` This commit refactors `vector_index::check_target` to correctly validate columns building the index. Vector-store currently support filtering by native types, so the type of columns is checked. The first column from the list must be a vector (to build index based on these vectors), so it is also checked. Allowed types for columns are native types without counter (it is not possible to create a table with counter and vector) and without duration (it is not possible to correctly compare durations, this type is even not allowed in secondary indexes). This commits adds cqlpy test to check errors while creating indexes. Fixes: SCYLLADB-298 This needs to be backported to version 2026.1 as this is a fix for filtering support. Closes scylladb/scylladb#28366	2026-01-30 01:14:31 +02:00
Jakub Smolar	e978cc2a80	scylla_gdb: use persistent GDB - decrease test execution time This commit replaces the previous approach of running pytest inside GDB’s Python interpreter. Instead, tests are executed by driving a persistent GDB process externally using pexpect. - pexpect: Python library for controlling interactive programs (used here to send commands to GDB and capture its output) - persistent GDB: keep one GDB session alive across multiple tests instead of starting a new process for each test Tests can now be executed via `./test.py gdb` or with `pytest test/scylla_gdb`. This improves performance and makes failures easier to debug since pytest no longer runs hidden inside GDB subprocesses. Closes scylladb/scylladb#24804	2026-01-29 10:01:39 +02:00
Botond Dénes	482ffe06fd	Merge 'Improve load shedding on the replica side' from Łukasz Paszkowski When reads arrive, they have to wait for admission on the reader concurrency semaphore. If the node is overloaded, the reads will be queued. They can time out while in the queue, but will not time out once admitted. Once the shard is sufficiently loaded, it is possible that most queued reads will time out, because the average time it takes to for a queued read to be admitted is around that of the timeout. If a read times out, any work we already did, or are about to do on it is wasted effort. Therefore, the patch tries to prevent it by checking if an admitted read has a chance to complete in time and abort it if not. It uses the following criteria: if read's remaining time <= read's timeout when arrived to the semaphore * live updateable preemptive_abort_factor; the read is rejected and the next one from the wait list is considered. Fixes https://github.com/scylladb/scylladb/issues/14909 Fixes: SCYLLADB-353 Backport is not needed. Better to first observe its impact. Closes scylladb/scylladb#21649 * github.com:scylladb/scylladb: reader_concurrency_semaphore: Check during admission if read may timeout permit_reader::impl: Replace break with return after evicting inactive permit on timeout reader_concurrency_semaphore: Add preemptive_abort_factor to constructors config: Add parameters to control reads' preemptive_abort_factor permit_reader: Add a new state: preemptive_aborted reader_concurrency_semaphore: validate waiters counter when dequeueing a waiting permit reader_concurrency_semaphore: Remove cpu_concurrency's default value	2026-01-29 08:27:22 +02:00
Marcin Maliszkiewicz	5d4e2ec522	Merge 'docs: add documentation for automatic repair' from Botond Dénes Explain what automatic repair is and how to configure it. While at it, improve the existing repair documentation a bit. Fixes: SCYLLADB-130 This PR missed the 2026.1 branch date, so it needs backport to 2026.1, where the auto repair feature debuts. Closes scylladb/scylladb#28199 * github.com:scylladb/scylladb: docs: add feature page for automatic repair docs: inter-link incremental-repair and repair documents docs: incremental-repair: fix curl example	2026-01-28 17:46:53 +01:00
Botond Dénes	1713d75c0d	docs: add feature page for automatic repair Explain what the feature is and how to confiture it. Inter-link all the repair related pages, so one can discover all about repair, regardless of which page they land on.	2026-01-28 16:45:57 +02:00
Łukasz Paszkowski	2d3a40e023	permit_reader: Add a new state: preemptive_aborted A permit gets into the preemptive_aborted state when: - times out; - gets rejected from execution due to high chance its execution would not finalize on time; Being in this state means a permit was removed from the wait list, its internal timer was canceled and semaphore's statistic `total_reads_shed_due_to_overload` increased.	2026-01-28 14:20:01 +01:00
Botond Dénes	7ac32097da	docs/cql/ddl.rst: Tombstones GC: explain propagation delay This parameter was not mentioned at all anywhere in the documentation. Add an explanation of this parameter: why we need it, what is the default and how it can be changed. Closes scylladb/scylladb#28132	2026-01-27 16:05:52 +01:00
Anna Stuchlik	edc291961b	doc: update the GPG keys Update the keys in the installation instructions (linux packages). Fixes https://github.com/scylladb/scylladb/issues/28330 Closes scylladb/scylladb#28357	2026-01-26 19:13:18 +02:00
Avi Kivity	32cc593558	Merge 'tools/scylla-sstable: introduce filter command' from Botond Dénes Filter the content of sstable(s), including or excluding the specified partitions. Partitions can be provided on the command line via `--partition`, or in a file via `--partitions-file`. Produces one output sstable per input sstable -- if the filter selects at least one partition in the respective input sstable. Output sstables are placed in the path provided via `--oputput-dir`. Use `--merge` to filter all input sstables combined, producing one output sstable. Fixes: #13076 New functionality, no backport. Closes scylladb/scylladb#27836 * github.com:scylladb/scylladb: tools/scylla-sstable: introduce filter command tools/scylla-sstable: remove --unsafe-accept-nonempty-output-dir tools/scylla-sstable: make partition_set ordered tools/scylla-stable: remove unused boost/algorithm/string.hpp include	2026-01-26 16:32:38 +02:00

1 2 3 4 5 ...

2060 Commits